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AGS PROTEINS AND NUCLEIC ACID MOLECULES AND USES THEREFOR 



Background of the Invention 

Heterotrimeric G protein-mediated signal transduction is a tightly regulated 
5 event. All known G protein-coupled receptor (GPCR) mediated signaling pathways rely 
on multiple regulatory mechanisms in order to prevent inappropriate induction of the 
signal and to facilitate recovery during chronic stimulation (Gilman (19$7) Ann. Rev. 
Biochem. 56:615-649, reviewed in Simon et al. (1991) Science 252:802-808; Conklin 
and Bourne (1993) Cell 73:631-641; Neer (1995) Cell 80:249-257; Rens-Domiano and 

10 Hamm (1995) FASEBJ. 9:1059-1066). These regulatory mechanisms function at every 
level of the signaling cascade. Regulation of GPCR activation is believed to involve 
phosphorylation of the receptor C-terminus and subsequent receptor internalization 
(Palczewski and Benkovic (1991) Trends Biol. ScL 16:387-391; Goodman etal. (1996) 
Nature 383:447-450; Chen and Konopka (1996) Mol. Cell Biol 16:247-257), though 

15 this does not appear to be a universal mechanism (Daunt et al. (1 997) Mol Pharm. 

51 :71 1-720). Known mechanisms of regulation of signal transduction at the level of the 
heterotrimeric G protein include receptor-mediated facilitation of GTP/GDP exchange 
on Get (reviewed in Simon et al. (1991) Science 252:802-808; Conklin and Bourne 
(1993) Cell 73:631-641; Neer (1995) Cell 80:249-257; Rens-Domiano and Hamm 

20 (1995) FASEBJ, 9:1059-1066) and enhancement of the intrinsic GTPase activity of Got 
proteins by RGS-like proteins (reviewed in Berman and Gilman (1998) J. Biol Chem. 
273:1269-1272). Activation of PAKs, serine/threonine kinases that transduce signals 
from heterotrimeric G proteins to the MAP kinase cascade, has been shown to occur 
through interaction with either the small G proteins Cdc42 and Rac, or through 

25 interaction with heterotrimeric G proteins (reviewed in Sells and Chernoff (1997) Trends 
Cell Biol. 7:162-167). GPCR-coupled MAP kinase cascades and their downstream 
transcription factors, in turn, are regulated through phosphorylation/dephosphorylation 
cycles that may or may not require small G proteins (reviewed in Cobb and Goldsmith 
(1995)7. Biol Chem. 270:14843-14846). Non-receptor activators of G-proteins have 

30 also been identified. These include both protein activators (Strittmatter et al. ( 1 993) 
Proc. Nat'lAcad. ScL USA 90:5327-5331; Okamoto et al. (1995) J. Biol. Chem. 
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270:4205-4208; Sato et al. (1996) J. Biol. Chem. 271 .30052-30060) and non-protein 
activate* (summarized in Odagaki et al. (1998) Life Sciences 62:1537-1541. 

Even with the identification of these diverse regulatory systems, an in-depth 
understanding of the temporal and spatial regulation of GPCR mediated signaling 
remains elusive. In fact, cellular variations in the efficiency and/or specificity of 
coupling observed for many specific receptor-heterotrimeric G protein complexes 
suggest the presence of additional unidentified, cell-specific regulators of the signaling 
process. 



10 



Summary of the Invention 

In an attempt to identify novel factors involved in regulating scaling through 
GPCR-mediated signal transduction pathways, a screening system was devised in the 
yeast Saccharomyces cerevisiae designed to identify receptor-independent activators of 
the pheromone response pathway. These functional screens can be designed to target not 
15 only specific regulatory pathways in yeast, but also an introduced mammalian 

component or components.Yeast strains containing an intact signal transduction cascade 
but lacking a functional GPCR were made conditional for growth upon either 
Pheromone pathway activation (activator screen) or pheromone pathway inactivation 
(inhibitor screen). In addition, the activator yeast strain carries an integrated FUSlp- 
20 HIS3 construct, making histidine prototrophy conditional upon pheromone pathway 
activation. The inhibitor yeast strain carries an episomal FUSJp-CANI construct 
Adult human liver cDNAs expressed in a yeast vector under control of a galactose- 
inducible promoter were introduced into these strain, and those cDNAs that conferred 
growth in a galactose- and insert-dependent fashion were identified. Provided herein is 
25 the characterization of one fmember of these activator cDNAs, encoding a protein of 281 
amino acids with significant homology to ras-related G proteins and containing 
alterations in conserved amino acids consistent with a deficiency in OTP hydrolysis 
activity (i.e., a constitutive^ active form of ras-related G protein). Genetic epistasis 
tests in yeast were consistent with this activator fimctioning at the level of the 
30 heterotrimeric G-protein and facilitating GTP exchange on the chimeric G«i2. This 
protein is referred to herein as Activator of G protein Signaling, or "AGS", or 
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AGS1 AGS1 also shows Ga selectivity, as measured by growth assays in yeast 
expressing various mammalian Ga constructs, and tissue specific expression, as 
measured by Northern blot analysis. A cDNA product identified from the inhibitor 
screen encodes a previously identified regulator of G-protein signaling, human RGS5. 
5 In one embodiment, an isolated nucleic acid molecule of the present invention 

encodes an AGS protein (e.g., the nucleic acid molecule has a nucleotide sequence 
having at least 86% identity to the nucleotide sequence of SEQ ID NO: 1 or the 
complement thereof). In another embodiment, an isolated nucleic acid molecule of the 
present invention has a nucleotide sequence having at least 90% identity to the 
10 nucleotide sequence of SEQ ID NO: 1 or SEQ ID NO: 3, or the complement of SEQ ID 
NO: 1 or SEQ ID NO: 3. In yet another embodiment, the isolated nucleic acid molecule 
has the nucleotide sequence of SEQ ID NO: 1, or the complement thereof In another 
embodiment, the isolated nucleic acid molecule has the nucleotide sequence of SEQ ID 
NO: 3, or the complement thereof. In yet another embodiment, an isolated nucleic acid 
1 5 molecule of the present invention encodes a protein that activates G protein-coupled 
signal transduction in a G protein-coupled receptor independent manner. 

In another embodiment, an isolated nucleic acid molecule of the present 
invention has a nucleotide sequence which encodes a protein having an amino acid 
sequence at least 97% identical to the amino acid sequence of SEQ ID NO: 2. In another 
20 embodiment, the isolated nucleic acid molecule encodes a protein having the amino acid 
sequence of SEQ ID NO: 2. In yet another embodiment, an isolated nucleic acid 
molecule of the present invention encodes a protein which activates G protein-coupled 
signal transduction in a G protein-coupled receptor independent manner. 

The present invention also provides vectors including nucleic acid sequences 
25 which encode all or a portion of an AGS protein as well as host cells including such 
vectors. The invention further provides methods for producing an AGS protein 
including culturing host cells which express an AGS protein. The invention also 
provides transgenic animals which contain cells carrying a transgene encoding AGS 
protein. 
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In another embodiment, the present mvention provides isolated AGS proteins 
(**. an .solated AGS protein having an ammo acid sequence at least 97% identity to the 
-no acid sequence of SEQ ID NO: 2.) In another embodiment, the protem has the 
ammo acid sequence of SEQ ID NO: 2. The present invention also provides fusion 
S Protems having at least a portion of an AGS protem operative.y linked to a non-AGS 
Polypept.de as well as antibodies that specifically bind to an AGS protein (e g . 
monoclonal or polyclonal anybodies). The invention further provides pharmaceutical 
composmons including AGS proteins or AGS-antibodies. 

modulatecel^^^ 

of (a) contacting a cell that expresses an AGS protein with a test compound- (b) 
determinmg the effect of the test compound on the activity of the AGS protem- and (c) 
.denying *e test compound as a modulator of signal transduction based on the abi % 
of the compound to modulate the activity of the AGS protein in the cell. In another 
5 embodiment, the AGS proteins utilized in the subject methods have an amino acid 
sequence which is at least 97% identical to SEQ ID NO: 2 and stimulates G protein 
-cuvny m a receptor-independent manner. In yet another embodiment, the AGS protein 
used m the subject methods has the amino acid sequence of SEQ ID NO: 2 

In yet another embodiment, the compound identified by the above method is a 
nucle. acd encoding a polypeptide capable of habiting the activitv of the AGS 
P-em. In still another embodiment, the above method ftrther comprises a nuc.eic acid 
encodmg an in^bitor of the AGS protem. Inapreferred embodiment, the abovemethod 
>s su-table for identifying a test compound capable of modulating the activity of the AGS 
protein by modulating the inhibitor of the AGS protein. 

In a preferred embodiment, cells used in the screening methods of the present 
mvenuon have been engineered to express the AGS protein. Preferred cells for use in 
he screening methods are yeast cell, ,„ another preferred embodiment, the yeast cells 

^-.^I^d^G^a^ Theactivityofatestcompound 
on a cell-associated activity { , g . , a G-protein mediated activity) can be determined by 
momtoring a pheromone response pathway in the yeast cells (,,., by measuring the 
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activity of a pheromone responsive promoter in the yeast cells), by monitoring the ability 
of the test compound to bind to the AGS protein, or by monitoring the ability of the test 
compound to modulate the interaction of the AGS protein with a target molecule (e.g., a 
G protein). 

5 The present invention further provides methods for modulating G protein- 

coupled signal transduction in a cell (e.g., by contacting a cell with an agent which 
modulates AGS protein activity or AGS nucleic acid expression). Methods for treating a 
subject having a disorder characterized by aberrant AGS protein activity or nucleic acid 
expression are also provided as well as methods for detecting the presence of AGS in a 

10 biological sample. 

Other features and advantages of the invention will be apparent from the 
following detailed description and claims. 



Brief Description of the Drawings 

15 Figures J A and IB depict diagrams of the yeast pheromone response pathway 

with major signaling components, (Abbr. indicated are: a, Gpal; p, Ste4; y, Stel8; P, 
phosphorylation) and the modifications made to the pheromone response pathway (Fig. 
IB) for the Activator screen (i.e., strains CY1316/1183) and the Inhibitor screen (e.g., 
strains C Y 1 1 4 1 / 1 45 1 - AGS 1 /2440). 

20 Figure 2 depicts the major steps in carrying out the yeast Activator (left panel) 

and Inhibitor (right panel) screens. 

Figures 3A and3B depict the coding region of the cDNA sequence (Fig. 3 A) 
and predicted amino acid sequence of human AGS (Fig. 3B). The nucleotide sequence 
corresponds to SEQ ID NO: 1. The amino acid sequence corresponds to SEQ ID NO:2. 

25 The composite insert region of cDNA clone pYES2-Ll is available from GenBank, 
Accession No. #AF069506. Consensus sequences for ras-like G proteins (Valencia, et 
al. (1991) Biochemistry 30:4637) are underlined. Regions unique to AGS1 are indicated 
in italics. Three residues that are normally conserved in ras-like G proteins but 
divergent in AGS 1 and Rndl-3 (Nobes, et al. (1998) J. Cell Biol. 141:187) are indicated 

30 with asterisks below the residue. These divergent residues were shown to confer 
GTPase deficiency in Rndl. 
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Figures 4A and 4B depict the alignment of AGS with representatives of all 
major classes of small G proteins in humans indicating that AGS is likely to be the 
founding member of a novel class of small G proteins in humans. 

Figure 5 depicts the alignment of residues m the P region and in the G' region of 
5 various small G proteins. Asterisks indicate the location of three highly conserved 

rescues that are altered ,„ AGS, Rndl , Rnd2, and Rndl Numbers indicate the positions 
of the amino acid sequence of AGS. 

Detailed Description » f the Invention 

0 The present invention is based on the discovery of nucleic acid and protein 

molecules, referred to herein as Activator of G protein Signaling ("AGS") protems and 
nucleic acid molecules, which play a role in or function in G protein-mediated signal 
transduction in the absence of receptor stimulation. These molecules were identified 
usmg an assay of the invention that employs yeast-based functional screens using the 
» Pheromone response pathway. In part , the assay relies on the observation that a G- 
protem coupled receptor (GPCR) signaling pathway is requir ed for mating in haploid 
yeast. Moreover, there is functional redundancy between this pathway and the 
mammalian GPCR signaling pathway and al, of the major signalmg components and 
regulatory systems in GPCR-mediated s.gnal transduction in mammalian systems 
appears to be conserved in the yeast pheromone response pathway (Figure 1 A) (Kurjan 
(1993)^ Re, Gene,. 27:147-179; Bardwell et al., De, Biol. 166:363-379) Thus 
thts assay can be used to search for mammalian regulators of this system. Normally ' 
upon receptor activation by pheromone, the GPCR associated heterotrimeric G-protein 
undergoes subunit dissociation into GTP-bound activated Ga (Gpal) and G P y 
(Ste4/Stel8). Free G P y dimer then transduces a signal through a P 2 1 -activated kinase 
(Ste20) to a MAP kinase cascade, leading to the transcriptional activation of mating- 
specific genes by the transcription factor Stel2, as well as Farl-mediated growth arrest 
» the Gl phase of the cell cycle. For the screening assay of the present inventions the 
pheromone response pathway was engineered to create yeast strains that could be made 
condmonal for growth upon either pheromone pathway activation or suppressjon ^ 
IB). Usmg these strains, functional screens were developed to identify mammalian 
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cDNAs whose expression either activates or down-regulates the yeast pheromone 
response pathway independent of the presence of receptor. A human AGS cDNA was 
isolated in a functional cloning screen in yeast based upon its ability to activate G 
protein signaling in a manner independent of G protein-coupled receptor stimulation. 

5 Genetic evidence (described in detail in Examples 1-3) indicates that this AGS- 

dependent activation occurs at the level of the heterotrimeric G protein. Thus, in one 
embodiment, the AGS molecules stimulate the activity of one or more G proteins 
involved in a G protein-mediated signal transduction pathway, e.g., a pheromone 
response cascade in yeast, to thereby activate G protein-mediated signal transduction 

10 independent of G protein coupled receptor stimulation. In a preferred embodiment, the 
AGS molecules of the present invention stimulate the activity of one or more G proteins 
involved in a G protein-mediated signal transduction pathway, such that G protein 
coupled receptor-mediated signal transduction is amplified. In a particularly preferred 
embodiment, the AGS molecules are capable of modulating the activity of Get subunits, 

15 such as a mammalian Gai2 subunit or a chimeric Ga subunit comprising a portion of the 
yeast Gpal protein (e.g., the amino-terminal 41 amino acids) linked to a mammalian 
Gai2 subunit. 

Since the AGS proteins of the invention can function in activation of the 
pheromone response cascade in yeast cells, and potentially modulate the MEK pathway 

20 in mammalian cells, the AGS molecules of the present invention can be used in methods 
for identifying antagonists of G protein signaling, either receptor-dependent or receptor 
independent, in screening assays in host cells, such as mammalian or yeast host cells. 

A particularly preferred AGS nucleic acid and protein, depicted in Figure 5 (and 
shown in SEQ ID NO: 1 and 2, respectively), is isolated from human cells. Figure 5 

25 depicts the nucleotide sequence of the coding region of an AGS cDNA which was 
isolated from a human liver cDNA library. An AGS cDNA nucleotide sequence that 
includes 5' and 3* untranslated regions is shown in SEQ ID NO: 3. The cDNA sequence 
encodes a predicted protein which is 281 amino acid residues in length and which 
exhibits homology to ras-related G proteins. The AGS protein also contains alterations 
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in amino ac.ds that typically are conserved among ras-related G proteins that are 
consistent with AGS having a deficiency in GTP hydrolysis activity. 

The AGS proteins of the invention contain several motifs characteristic of Ras 
superfamily members, including the phosphate/magnesium binding regions 
5 GXXXXGK(S/T)(SEQ ID NO: 18) (the P-loop) and DXXG (SEQ ID NO: 19) (the G' 
region), as well as the guanine base binding loops NKXD (SEQ ID NO: 20) and EXSAK 
(SEQ ID NO: 21) (Bourne et al. (1991) Nature 349:1 17-127; Valencia et al (1991) 
Biochemistry 30:4637-4648). Additionally, the motif regions G-l through G-5 
characteristic of GTPases (Bourne et al. (1991) Nature 349:1 1 7-127), are present in 
.0 AGS. Moreover, the C-terminal region of AGS has a typical CAAX (SEQ ID NO- 22) 
motif (Bourne et al. (,991) Nature 349:1 ,7-127; Valencia et al. (199.) Biochemistry 
30:4637-4648; Del Villar et al. (1996) Biochem. Soc. Trans. 24:709-7,3). The CAAX 
motif is immediately preceded by a basic region QAKDKER (SEQ ID NO- 23) thought 
to be important in anchoring ras-like G proteins to the phospholipid bilayer (Magee and 
15 Newman (1992) Trends Cell Biol. 2:3,8-323). 

Various aspects of the invention are described in further detail in the following 
subsections: 



20 I- Isolated N ucleic Acid Mnler mIp, 

One aspect of the invention pertains to isolated nucleic acid molecules that 
encode an AGS protein or biologically active portion thereof, as wel, as nucleic acid 
fragments sufficient for use as hybridization probes to identify AGS-encoding nucleic 
aad (e* , AGS mRNA). As used herein, the term "nucleic acid moiecule" is intended to 
25 include DNA molecules (e.g., cDNA or genomic DNA) and RNA molecules (e * 
mRNA) and analogs of the DNA or RNA generated using nucleotide analogs The 
nucleic acid mCecuie can be single-stranded or double-stranded, but preferably is 
double-stranded DNA. An "isolated" nucleic acid molecule is one which is separated 
from other nucleic acid molecules which are present in the natural source of the nucleic 
>0 ac.d. Preferably, an "isolated" nucleic acid is free of sequences which naturally flank the 
nucleic acd (e.g., sequences located at the 5' and 3' ends of the nucleic acid) in the 
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genomic DNA of the organism from which the nucleic acid is derived. For example, in 
various embodiments, the isolated AGS nucleic acid molecule can contain less than 
about 5 kb, 4kb, 3kb, 2kb, 1 kb, 0.5 kb or 0.1 kb of nucleotide sequences which naturally 
flank the nucleic acid molecule in genomic DNA of the cell from which the nucleic acid 
5 is derived. Moreover, an "isolated" nucleic acid molecule, such as a cDNA molecule, 
can be substantially free of other cellular material, or culture medium when produced by 
recombinant techniques, or chemical precursors or other chemicals when chemically 
synthesized. 

A nucleic acid molecule of the present invention, e.g., a nucleic acid molecule 

10 having the nucleotide sequence of SEQ ID NO:l or SEQ ID NO: 3, or a portion thereof, 
can be isolated using standard molecular biology techniques and the sequence 
information provided herein. For example, a human AGS cDNA can be isolated from a 
human liver cDNA library using all or portion of SEQ ID NO: 1 or SEQ ID NO: 3 as a 
hybridization probe and standard hybridization techniques (e.g., as described in 

15 Sambrook, J., Fritsh, E. F., and Maniatis, T. Molecular Cloning: A Laboratory Manual. 
2nd, ed, Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY, 1989). Moreover, a nucleic acid molecule encompassing all or a 
portion of SEQ ID NO: 1 or SEQ ID NO: 3 can be isolated by the polymerase chain 
reaction using oligonucleotide primers designed based upon the sequence of SEQ ID 

20 NO: 1 or SEQ ID NO:3. For example, mRNA can be isolated from liver cells (e.g. , by 
the guanidinium-thiocyanate extraction procedure of Chirgwin et al. (1979) 
Biochemistry 1 8: 5294-5299) and cDNA can be prepared using reverse transcriptase 
(e.g., Moloney MLV reverse transcriptase, available from Gibco/BRL, Bethesda, MD; or 
AMV reverse transcriptase, available from Seikagaku America, Inc., St. Petersburg, FL). 

25 Synthetic oligonucleotide primers for PCR amplification can be designed based upon the 
nucleotide sequence shown in SEQ ID NO: 1 or SEQ ID NO:3. A nucleic acid of the 
invention can be amplified using cDNA or, alternatively, genomic DNA, as a template 
and appropriate oligonucleotide primers according to standard PCR amplification 
techniques. The nucleic acid so amplified can be cloned into an appropriate vector and 

30 characterized by DNA sequence analysis. Furthermore, oligonucleotides corresponding 



SUBSTITUTE SHEET (RULE 26) 



WO 99/58670 

PCT/US99/10J51 

- 10- 

to an AGS nucleotide sequence can be prepared by standard synthetic techniques . g 
using an automated DNA synthesizer. ' ' 

In a preferred embodiment, an isolated nucleic acid molecule of the invention 
compnses the nucleotide sequence shown in SEQ ID NO:,. The sequence of SEQ ID 
5 NO:l corresponds to the coding region of an AGS cDNA isolated from human liver 
cells. Another preferred AGS cDNA sequence is shown in SEQ ID NO- 3 which 
mcludes 5' and 3' untranslated regions. This cDNA comprises sequences encoding the 
AGS proton "the coding region", from nucleotides 154 to 999) as well as 5' 

,0 ~~ <W ' 0llta7te 153)and3 '— 

In another preferred embodiment, an isolated nucleic acid molecule of the 
■nvenuoncomprisesanucleic acid molecule which ,s a complement of the nucleotide 
sequence shown in SEQ ID NO: 1 or SEQ ID NO:3, or a portion of any of these 
nucleotide sequence, A nucleic acid molecule which is the complement of the 
■5 nucleo.de sequence shown in SEQ ID NO:l or SEQ ID NO:3 is one which has a 

nucleotide sequence that directly pai re with that of SEQ ID NO: , or 3, according to the 
rules of Watson and Crick base pairing, wherein A pairs with T and G pairs with C For 
example, the complement of the sequence 5' GGATGC 3' would be 3'CCTACG 5" 
(which, written in the standard 5' to 3' direction, would be 5' GCATCC 3') 
0 In still another preferred embodiment, an isolated nucleic acid molecule of the 

mventton comprises a nucleotide sequence which is at least 60%, preferably at least 
70o/o, more preferably at 80%, and even more preferably at least 90%, or 95% or 96% 
or 97%, or 98%, or 99% homologous to the nucleotide sequence show, in SEQ ID NO- , 
or SEQ ID NO:3, or a portion of any of these nucleotide sequences. In an additional 
pre erred embodiment, an isolated nucleic add molecule of the inven.on comprises a 
nuc leo.de sequence which hybridizes, , g . , hybridizes under stringent conditions to the 
nucleo.de sequence shown in SEQ ID NO:l or SEQ ID NO:3, or aportion of any of 
these nucleotide sequences. 

Mo^ver, a. „« Wc ^ molecule of ^ ^ ^ ^ ^ 

- be used as . probe or ^ or . ^ ^ ^ 
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an AGS protein. The nucleotide sequence determined from the cloning of the AGS 
genes from a mammal, e.g., humans, allows for the generation of probes and primers 
designed for use in identifying and/or cloning AGS homologues in other cell types, e.g. 
from other tissues, as well as AGS homologues from other mammals, e.g., mice, rats, 
5 etc. The probe/primer typically comprises substantially purified oligonucleotide. The 
oligonucleotide typically comprises a region of nucleotide sequence that hybridizes 
under stringent conditions to at least about 12, preferably about 25, more preferably 
about 40, 50 or 75 consecutive nucleotides of SEQ ID NO: 1 or SEQ ID NO:3 sense, an 
anti-sense sequence of SEQ ID NO:l or SEQ ID NO:3, or naturally occurring mutants 
10 thereof. Primers based on the nucleotide sequence in SEQ ID NO: 1 or SEQ ID NO:3 
can be used in PCR reactions to clone AGS homologues. Probes based on the AGS 
nucleotide sequences can be used to detect transcripts or genomic sequences encoding 
the same or homologous proteins. In preferred embodiments, the probe further 
comprises a label group attached thereto, e.g. the label group can be a radioisotope, a 
1 5 fluorescent compound, an enzyme, or an enzyme co-factor. Such probes can be used as 
a part of a diagnostic test kit for identifying cells or tissue which misexpress an AGS 
protein, such as by measuring a level of an AGS-encoding nucleic acid in a sample of 
cells from a subject e.g., detecting AGS mRNA levels or determining whether a genomic 
AGS gene has been mutated or deleted. 
20 In one embodiment, the nucleic acid molecule of the invention encodes a protein 

or portion thereof which includes an amino acid sequence which is sufficiently 
homologous to an amino acid sequence of SEQ ID NO:2 such that the protein or portion 
thereof maintains the ability to modulate a G-protein mediated response in a cell. As 
used herein, the language "sufficiently homologous" refers to proteins or portions 
25 thereof which have amino acid sequences which include a minimum number of identical 
or equivalent (e.g., an amino acid residue which has a similar side chain as an amino 
acid residue in SEQ ID NO:2) amino acid residues to an amino acid sequence of SEQ ID 
NO:2 such that the protein or portion thereof is able to modulate a G-protein mediated 
response in a cell. In another embodiment, the protein is at least 60%, preferably at least 
30 70%, more preferably at least 80%, more preferably at least 90% and most preferably at 
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least 95%, or 96%, or 97% or QK% rtr ooo/ u 

. w/o, or 98/ 0) or 99% homologous to the entire amino acid 
sequence ofSEQ ID NO:2. 

Portions of prMe , ns encoded by the AGS ^ ac . d 

biologically active portion of AGS" is intended to include a portion . 

^rnotif of AOS tba, „as one or more of , h e f„„ olvi „ 6 acivitie's: „' i, can ta 

WIIn ( e g-, bind to) a G protein; 2) it can modulate the activity of a 0 protein 1 3) it can 

bind ,„) a G protein ,a,ge, molecu,, 4, i, can m„du,a,e tne acnvity of 
» C protetn ^ ^ „ „ _ modulaie a Q proKin . medjattd * 

«„, of 0 p rott i„-c 0upW recepl0 , aM , vauon; md 4) m • 
«* « preceptor slgIlaIillg by modu , aling . „ ^ ^ _ 

S »nd M »i„ din8 ^ , lmlllnnoprecipita , ions ^ ^ , d ^ 
co iumn mndmg ^ M desC[jbed ^ ^ ^ ^ ^ 
. ty of an AGS P ro B ,„ or a b , ologically Mive portio „ ^ 

b,»d B ,a 0 p ro ^ Tode.rminewhcmeranAGSproteinorbioiogicaUyactive 
portion ^of can modulate G.protein mediated K p« in . ceU , „, AGg ^ ^ 

lansfected, wn.cn - ^„ entered ,„ g™ 0 „| y , lhe _ rf . ^ 
- biological* activepornon tneteof, e. s „ yeast ce„ strain I3I „ ,„ (dcsciib ^ , he 
- «■ tne abi,i,y of ,be AGS protein or oi^y active ^ W J" 

grow* dete^cd. Actively, a cei, can „e Wormed effected wtn 
G-protein mediated signal ftansducrion responsive reporter construe, FUS1- 
■ucferase, wnicb responds ,„ G-protein medial signaling b y expressing , ci W ^ 
a nucleic acid encoding ,„e AGS protein or biologically active portion tlof. 

* -^^»d,ysed M drepor ttr »„vi We ,,, U c,fe r asc M iv it y,c Mbe 

««d and compared to reports activity in a control cc„. Exa mp,es of control ce„s 

eells wnich include to G-p role ,„ mediated signal transduction responsive 
reporter construct. An alteration in reporter activity in to C e„ s which include nuc|eic 

nuc letc acid encoding tne AGS protein is indicative of a modulation of a G-p^ein 
mediated response in the ceil. 
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In addition to the AGS nucleotide sequences shown in SEQ ID NO:l and SEQ 
ID NO:3, it will be appreciated by those skilled in the art that DNA sequence 
polymorphisms that lead to changes in the amino acid sequences of AGS may exist 
within a population. Such genetic polymorphism in the AGS gene may exist among 
5 individuals within a population due to natural allelic variation. As used herein, the terms 
"gene" and "recombinant gene" refer to nucleic acid molecules comprising an open 
reading frame encoding an AGS protein, preferably a mammalian AGS protein. Such 
natural allelic variations can typically result in 1-5% variance in the nucleotide sequence 
of the AGS gene. Any and all such nucleotide variations and resulting amino acid 
10 polymorphisms in AGS that are the result of natural allelic variation and that do not alter 
the functional activity of AGS are intended to be within the scope of the invention. 
Moreover, nucleic acid molecules encoding AGS proteins from other species, and thus 
which have a nucleotide sequence which differs from the sequence of SEQ ID NO:l or 
SEQ ID NO:3, are intended to be within the scope of the invention. For example, non- 
15 human homologues of the AGS cDNAs of the invention can be isolated based on their 
homology to the AGS nucleic acid disclosed herein using the human cDNA, or a portion 
thereof, as a hybridization probe according to standard hybridization techniques under 
stringent hybridization conditions. Accordingly, in another embodiment, an isolated 
nucleic acid molecule of the invention is at least 15 nucleotides in length and hybridizes 
20 under stringent conditions to the nucleic acid molecule comprising the nucleotide 

sequence of SEQ ID NO:l or SEQ ID NO:3. In other embodiments, the nucleic acid is 
at least 30, 50, 100, 250 or 500 nucleotides in length. As used herein, the term 
"hybridizes under stringent conditions" is intended to describe conditions for 
hybridization and washing under which nucleotide sequences at least 60% homologous 
25 to each other typically remain hybridized to each other. Preferably, the conditions are 
such that sequences at least about 65%, more preferably at least about 70%, and even 
more preferably at least about 75% or more homologous to each other typically remain 
hybridized to each other. Such stringent conditions are known to those skilled in the art 
and can be found in Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. 
30 (1989), 6.3.1-6.3.6. A preferred, non-limiting example of stringent hybridization 

conditions are hybridization in 6X sodium chloride/sodium citrate (SSC) at about 45°C, 
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isolated nucleic acid molecule comprises a nucleotide sequence encoding a protein, 
wherein the protein comprises an amino acid sequence at least about 60% homologous to 
the amino acid sequence of SEQ ID NO:2, and is capable of modulating a G-protein 
mediated response in a cell. Preferably, the protein encoded by the nucleic acid 
5 molecule is at least about 70% homologous to SEQ ID NO:2, more preferably at least 
about 80-85% homologous, even more preferably at least about 90-95% homologous, 
and most preferably at least about 95, 96, 97, 98, or 99% homologous to SEQ ID NO:2. 

To determine the percent homology of two amino acid sequences or of two 
nucleic acids, the sequences are aligned for optimal comparison purposes (e.g., gaps can 
10 be introduced in the sequence of a first or second sequence for optimal alignment). In a 
preferred embodiment, the length of a reference sequence aligned for comparison 
purposes is at least 30%, preferably at least 40%, more preferably at least 50%, even 
more preferably at least 60%, and even more preferably at least 70%, 80%, or 90% of 
the length of the reference sequence. The amino acid residues or nucleotides at 
15 corresponding amino acid positions or nucleotide positions are then compared. When a 
position in the first sequence is occupied by the same amino acid residue or nucleotide as 
the corresponding position in the second sequence, then the molecules are homologous 
at that position (e.g., as used herein amino acid or nucleic acid "homology" is equivalent 
to amino acid or nucleic acid "identity"). The percent homology between the two 
20 sequences is a function of the number of identical positions shared by the sequences 
(e.g., % homology = # of identical positions/total # of positions x 100). This number 
can be modified if using a alignment algorithm which allows for gaps. Alternatively, a 
percent similarity between two sequences can be determined wherein a non-identical 
pair of, for example, amino acids sharing a position are evolutionary conserved. Such a 
25 comparison can be made utilizing an algorithm which relies on a residue weight table, 
for example, a PAM residue weight table (e.g., PAM 120 or PAM 150) or a BLOSUM 
weight residue table (e.g. BLOSUM 62). 

The comparison of sequences and determination of percent homology between 
two sequences can be accomplished using a mathematical algorithm. A preferred, non- 
30 limiting example of a mathematical algorithm utilized for the comparison of sequences 
is the algorithm of Karlin and Altschul (1990) Proc. Natl. Acad. Sci. USA 87:2264-68, 
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MegAlign* sequence alignment software. When utilizing the Wilbur-Lipmann 
algorithm, a K-tuple of 1 , a GAP penalty of 3, a window of 5, and diagonals saved set to 
= 5 can be used. Multiple alignment can be performed using the Clustal algorithm. 

An isolated nucleic acid molecule encoding an AGS protein homologous to the 

5 protein of SEQ ID NO:2 can be created by introducing one or more nucleotide 

substitutions, additions or deletions into the nucleotide sequence of SEQ ID NO:l or 
SEQ ID NO:3, such that one or more amino acid substitutions, additions or deletions are 
introduced into the encoded protein. Mutations can be introduced into SEQ ID NO.l or 
SEQ ID NO:3, by standard techniques, such as site-directed mutagenesis and PCR- 

10 mediated mutagenesis. Preferably, conservative amino acid substitutions are made at 
one or more predicted non-essential amino acid residues. A "conservative amino acid 
substitution" is one in which the amino acid residue is replaced with an amino acid 
residue having a similar side chain. Families of amino acid residues having similar side 
chains have been defined in the art. These families include amino acids with basic side 

15 chains (e.g., lysine, arginine ? histidine), acidic side chains (e.g., aspartic acid, glutamic 
acid), uncharged polar side chains (e.g., glycine, asparagine, glutamine, serine, 
threonine, tyrosine, cysteine), nonpolar side chains (e.g., alanine, valine, leucine, 
isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side chains 
(e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, 

20 phenylalanine, tryptophan, histidine). Thus, a predicted nonessential amino acid residue 
in an AGS protein is preferably replaced with another amino acid residue from the same 
side chain family. Alternatively, in another embodiment, mutations can be introduced 
randomly along all or part of an AGS coding sequence, such as by saturation 
mutagenesis, and the resultant mutants can be screened for an AGS activity described 

25 herein to identify mutants that retain AGS activity. Following mutagenesis of SEQ ID 
NO:l or SEQ ID NO:3, the encoded protein can be expressed recombinantly and the 
activity of the protein can be determined using, for example, assays described herein. 

In addition to the nucleic acid molecules encoding AGS proteins described 
above, another aspect of the invention pertains to isolated nucleic acid molecules which 

30 are antisense thereto. An "antisense" nucleic acid comprises a nucleotide sequence 
which is complementary to a "sense" nucleic acid encoding a protein, e.g., 
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complementary to the coding strand of a double-stranded cDNA molecule or 
complementary to an mRNA sequence. Accordingly, an antisense nucleic acid can 
hydrogen bond to a sense nucleic acid. The antisense nucleic add can be 
complementary to an entire AGS coding strand, or to on.y a portion thereof. In one 
5 embod.ment, an antisense nucleic acid molecule is antisense to a "coding region" of the 
codmg strand of a nucleotide sequence encoding an AGS protein. The term "coding 
regton" refers to the region of the nucleotide sequence comprising codons which are 
translated into amino acid residue, In another embodiment, the antisense nucleic acid 
molecule is antisense to a "noncoding region" of the coding strand of a nucleotide 
'0 sequence encoding AGS. The term "noncoding region" refers to 5' and 3' sequences 

to as 5 and 3' untranslated regions). 

Given the coding strand sequences encoding AGS proteins disclosed herein (e g 
SEQID NO:l and SEQ ID NO: 3), antisense nucleic acids of the invention can be 
« deigned according to the rules of Watson and Crick base pairing. The antisense nucleic 
ac ld molecule can be complementary to the entire coding region of an AGS mRNA but 
more preferably is an oligonucleotide which » antisense to only a portion of the coding 
ornoncodmgregionofanAGSmRNA. For example, the antisense oligonucleotide can 
be complementary to the region surrounding the translation start site of an AGS mRNA 
20 An antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25 30 35 40 45 
or 50 nucleotides in length. An antisense nucleic acid of the invention can be 
constructed using chemical synthesis and enzymatic ligation reacts using procedures 
known m the art. For example, an antisense nucleic acid (e.g., an antisense 
ohgonucleotide) can be chemically synthesized using naturally occurring nucleotides or 
vanously modified nucleotides designed to increase the biological stability of the 
molecules or to increase the physical stability of the duplex formed between the 
antrsense and sense nucleic acids, e.g., phosphorothioate derivatives and acridine 
substituted nucleotides can be used. 

Alternatively, the antisense nucleic acid can be produced biologically using an 
30 expression vector into which a nucleic acid has been subcloned in an antisense 

orientation (e.g., RNA transcribed from the inserted nucleic acid wil, be of an antisense 
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orienlation to a target nucleic acid of interest, described further in the following 
subsection). 

The antisense nucleic acid molecules of the invention are typically administered 
to a subject or generated in situ such that they hybridize with or bind to cellular mRNA 

5 and/or genomic DNA encoding an AGS protein to thereby inhibit expression of the 
protein, e.g., by inhibiting transcription and/or translation. An example of a route of 
administration of an antisense nucleic acid molecule of the invention includes direct 
injection at a tissue site. Alternatively, an antisense nucleic acid molecule can be 
modified to target selected cells and then administered systemically. e.g. The antisense 

10 nucleic acid molecule can also be delivered to cells using the vectors described herein. 
To achieve sufficient intracellular concentrations of the antisense molecules, vector 
constructs in which the antisense nucleic acid molecule is placed under the control of a 
strong pol II or pol III promoter are preferred. 

In yet another embodiment, the antisense nucleic acid molecule of the invention 

15 is an a-anomeric nucleic acid molecule. An a-anomeric nucleic acid molecule forms 
specific double-stranded hybrids with complementary RNA in which, contrary to the 
usual (J-units, the strands run parallel to each other (Gaultier et al. (1987) Nucleic Acids. 
Res. 1 5:6625-6641). The antisense nucleic acid molecule can also comprise a 2-o- 
methylribonucleotide (Inoue et al. (1987) Nucleic Acids Res. 15:6131-6148) or a 

20 chimeric RNA-DNA analogue (Inoue et al. (1987) FEBS Lett. 215:327-330). 

In still another embodiment, an antisense nucleic acid of the invention is a 
ribozyme. Ribozymes are catalytic RNA molecules with ribonuclease activity which are 
capable of cleaving a single-stranded nucleic acid, such as an mRNA, to which they have 
a complementary region. Thus, ribozymes (e.g. , hammerhead ribozymes (described in 

25 Haselhoff and Gerlach (1988) Nature 334:585-591)) can be used to catalytically cleave 
AGS mRNA transcripts to thereby inhibit translation of AGS mRNAs. A ribozyme 
having specificity for an AGS-encoding nucleic acid can be designed based upon the 
nucleotide sequence of an AGS cDNA disclosed herein, (see, e.g., Cech et al. U.S. 
Patent No. 4,987,071 and Cech et al. U.S. Patent No. 5,1 16,742). Alternatively, AGS 

30 mRNA can be used to select a catalytic RNA having a specific ribonuclease activity 
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from a pool of RNA molecule, (see, Bartel, D. and Szostak, J.W. (,993) Science 
261:1411-1418). 

Alternatively, AGS gene expression can be inhibited by targeting nucleotide 
sequences complementary to the regulatory region of an AGS gene ( e g. an AGS 
5 P^-and/orenha^^^ 

an AGS gene in target cells. See generally, Helene, C. (1991) Anticancer Drug Des 
6(6):569-84; Helene, C. et al. (1992) An, N. Y. Acad. Sci. 660:27-36; and Maher, L J 
( 1 992) Bioassays 1 4( 1 2):807- 1 5. 

10 Hi-Rgconi bjnant Expression Vecto rs and Host Cell. 

Another aspect of the invention pertains to vectors, preferably expression 
vectors, containing a nucleic acid encoding an AGS protein (or a portion thereof) As 
used herein, the term "vector" refe, to a nucleic acid molecule capable of Sporting 
another nucleic acid to which it has been linked. One type of vector is a "plasmid" 
15 which refers to a circular double stranded DNA loop into which additional DNA ' 
segments can be ligated. Another type of vector is a viral vector, wherein additional 
DNA segments can be ligated into the vira, genome. Certain vectors are capable of 
autonomous replication in a host cell into which they are introduced (e.g. bacterial 
vectors having a bacterial origin of replication and episoma. mammalian vectors) Other 
20 vectors (e.g., non-episomal mammalian vectors) are integrated into the genome of a host 
cell upon introduction into the host cell, and thereby are replicated along with the host 
genome. Moreover, certain vectors are capable of directing the expression of genes to 
which they are operatively linked. Such vectors are referred to herein as "expression 
vectors". In general, expression vectors of utility in recombinant DNA techniques are 
25 often ,n the form of plasmids. In the present specification, "plasmid" and "vector" can 
be used interchangeably as the plasmid is the most commonly used form of vector 
However, the invention is intended to include such other forms of expression vectors 
such as viral vectors (e.g., replication defective retroviruses, adenoviruses and adeno- 
assocated viruses) as well as baculoviral vectors, which serve equivalent functions 
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The recombinant expression vectors of the invention comprise a nucleic acid of 
the invention in a form suitable for expression of the nucleic acid in a host cell, which 
means that the recombinant expression vectors include one or more regulatory 
sequences, selected on the basis of the host cells to be used for expression, which is 

5 operatively linked to the nucleic acid sequence to be expressed. Within a recombinant 
expression vector, "operably linked" is intended to mean that the nucleotide sequence of 
interest is linked to the regulatory sequence(s) in a manner which allows for expression 
of the nucleotide sequence (e.g., in an in vitro transcription/translation system or in a 
host cell when the vector is introduced into the host cell). The term "regulatory 

10 sequence" is intended to includes promoters, enhancers and other expression control 
elements (e.g., polyadenylation signals). Such regulatory sequences are described, for 
example, in Goeddel; Gene Expression Technology: Methods in Enzymology 185, 
Academic Press, San Diego, CA (1990).e.g. It will be appreciated by those skilled in the 
art that the design of the expression vector can depend on such factors as the choice of 

1 5 the host cell to be transformed, the level of expression of protein desired, etc. The 
expression vectors of the invention can be introduced into host cells to thereby produce 
proteins or peptides, including fusion proteins or peptides, encoded by nucleic acids as 
described herein (e.g., AGS proteins, mutant forms of AGS proteins, fusion proteins, 
etc.). 

20 Typical fusion expression vectors include pGEX (Pharmacia Biotech Inc; Smith, 

D.B. and Johnson, K.S. (1988) Gene 67:31-40), pMAL (New England Biolabs, Beverly, 
MA) and pRIT5 (Pharmacia, Piscataway, NJ) which fuse glutathione S-transferase 
(GST), maltose E binding protein, or protein A, respectively, to the target recombinant 
protein. In one embodiment, the coding sequence of an AGS protein is cloned into a 

25 pGEX expression vector to create a vector encoding a fusion protein comprising, from 
the N-terminus to the C-terminus, GST-thrombin cleavage site-AGS protein. The fusion 
protein can be purified by affinity chromatography using glutathione-agarose resin. 
Recombinant AGS protein unfused to GST can be recovered by cleavage of the fusion 
protein with thrombin. 
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Examples of suitable inducible non-fusion E coli expression vectors include 
pTrc (Amann et al., (1988) Gene 69:301-315) and P ET 1 Id (Studier et al., Gene 
Expression Technology: Methods in Enzymology 185, Academic Press, San Diego 
California (1990) 60-89). One strategy to maximize recombinant protein expression in 
5 E coh is to express the protein in a host bacteria with an impaired capacity to 
proteolytically cleave the recombinant protein (Gottesman, S., Gene Expression 
Technology: Methods in Enzymology 185, Academic Press, San Diego, California (1990) 
1 19-128). Another strategy is to alter the nucleic acid sequence of the nucleic acid to be 
inserted into an expression vector so that the individual codons for each amino acid are 
0 those preferentially utilized in E coli (Wada et al. (1992) Nucleic Acids Res. 20:21 11- 
2118). Such alteration of nucleic acid sequences of the invention can be carried out by 
standard DNA synthesis techniques. 

In another embodiment, an AGS expression vector is a yeast expression vector 
Examples of vectors for expression in yeast S. cerivisae include pYepSecl (Baldari et 
> al., (1987) Embo J. 6:229-234), pMFa (Kurjan and Herskowitz, (1 982) Cell 30-933-943) 
PJRY88 (Schultz et al., (1987) Gene 54:1 ,3-123), and P YES2 (Invitrogen Corporation ' 
San Diego, CA). 

Alternatively, an AGS protein can be expressed in insect cells using baculovirus 
expression vectors. Baculovirus vectors available for expression of proteins in cultured 
msect cells (e.g., S f 9 cells) include the P Ac series (Smith et al. (1 983) Mol. Cell Biol. 
3:2156-2165), P BlueBacHis2 (Invitrogen Corporation, San Diego, CA), and the pVL 
series (Lucklow and Summers (1989) Virology 170:31-39). 

In yet another embodiment, a nucleic acid of the invention is expressed in 
mammalian cells using a mammalian expression vector. Examples of mammalian 
expression vectors include P cDNA3. 1/His (Invitrogen Corporation, San Diego CA) 
pCDM8 (Seed, B. (1 987) Nature 329:840) and pMT2PC (Kaufman et al (1987) 
EMBOJ. 6:187-195). When used in mammalian cells, the expression vector's control 
fcnoons are often provided by vira, regulatory elements. For example, commonly used 
promoters are derived from polyoma, Adenovirus 2, cytomegalovirus and Simian Virus 
40. For other suitable expression systems for both prokaryotic and eukaryotk cells see 
chapters 16 and 17 of Sambrook, J., Fritsh, E. F., and Maniatis, T. Molecular Cloning: 
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A Laboratory Manual. 2nd ed, Cold Spring Harbor Laboratory, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY, 1989. 

In another embodiment, the recombinant mammalian expression vector is 
capable of directing expression of the nucleic acid preferentially in a particular cell type 
5 (e.g. , tissue-specific regulatory elements are used to express the nucleic acid). Tissue- 
specific regulatory elements are known in the art. Non-limiting examples of suitable 
tissue-specific promoters include the albumin promoter (liver-specific; Pinkert et al. 
(1987) Genes Dev. 1:268-277), lymphoid-specific promoters (Calame and Eaton (1988) 
Adv. Immunol. 43:235-275), in particular promoters of T cell receptors (Winoto and 
10 Baltimore (1989) EMBOJ. 8:729-733) and immunoglobulins (Banerji et al. (1983) Cell 
33:729-740; Queen and Baltimore (1983) Cell 33:741-748), neuron-specific promoters 
(e.g., the neurofilament promoter; Byrne and Ruddle (1989) PNAS 86:5473-5477), 
pancreas-specific promoters (Edlund et al. (1985) Science 230:912-916), and mammary 
gland-specific promoters (e.g., milk whey promoter; U.S. Patent No. 4,873,316 and 
15 European Application Publication No. 264,1 66). Developmentally-regulated promoters 
are also encompassed, for example the murine hox promoters (Kessel and Gruss (1990) 
Science 249:374-379) and the a-fetoprotein promoter (Campes and Tilghman (1989) 
Genes Dev. 3:537-546). 

The invention further provides a recombinant expression vector comprising a 
20 DNA molecule of the invention cloned into the expression vector in an antisense 

orientation. That is, the DNA molecule is operatively linked to a regulatory sequence in 
a manner which allows for expression (by transcription of the DNA molecule) of an 
RNA molecule which is antisense to an AGS mRNA. For a discussion of the regulation 
of gene expression using antisense genes see Weintraub, H. et al., Antisense RNA as a 
25 molecular tool for genetic analysis, Reviews - Trends in Genetics, Vol. 1(1) 1986. 

Another aspect of the invention pertains to host cells into which a recombinant 
expression vector of the invention has been introduced. The terms "host cell" and 
"recombinant host cell" are used interchangeably herein. It is understood that such terms 
refer not only to the particular subject cell but to the progeny or potential progeny of 
30 such a cell. A host cell can be any prokaryotic or eukaryotic cell. For example, AGS 
protein can be expressed in bacterial cells such as E. coli, insect cells, yeast or 
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mammalian cells (such as Chinese hamster ovary cells (CHO) or COS cells). Other 
suitable host cells are known to those skilled in the art. 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via 
conventional transformation or transfection techniques. As used herein, the terms 
5 "transformation" and "transfection" are intended to refer to a variety of art-recognized 
techniques for introducing foreign nucleic acid (e.g., DNA) into a host cell, including 
calcium phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated 
transfection, lipofection, or electroporation. Suitable methods for transforming or 
transfecting host cells can be found in Sambrook, et al. (Molecular Cloning: A 
) Laboratory Manual. 2nd, ed, Cold Spring Harbor Laboratory, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY, 1989), and other laboratory manuals. 

For stable transfection of mammalian cells, it is known that, depending upon the 
expression vector and transfection technique used, only a small fraction of cells may 
integrate the foreign DNA into their genome. In order to identify and select these 
integrants, a gene that encodes a selectable marker (e.g., resistance to antibiotics) is 
generally introduced into the host cells along with the gene of interest. Preferred 
selectable markers include those which confer resistance to drugs, such as G418, 
hygromycin and methotrexate. Nucleic acid encoding a selectable marker can be 
introduced into a host cell on the same vector as that encoding AGS or can be introduced 
on a separate vector. Cells stably transfected with the introduced nucleic acid can be 
identified by drug selection (e.g., cells that have incorporated the selectable marker gene 
will survive, while the other cells die). 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 
culture, can be used to produce (e.g., express) an AGS protein. Accordingly, the 
invention further provides methods for producing AGS proteins using the host cells of 
the invention. In one embodiment, the method comprises culturing the host cell of 
invention (into which a recombinant expression vector encoding an AGS protein has 
been introduced) in a suitable medium until AGS protein is produced. In another 
embodiment, the method further comprises isolating AGS protein from the medium or 
the host cell. 
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The host cells of the invention can also be used to produce nonhuman transgenic 
animals. The nonhuman transgenic animals can be used in screening assays designed to 
identify agents or compounds, e.g., drugs, pharmaceuticals, etc., which are capable of 
ameliorating detrimental symptoms of selected disorders such as cardiovascular 

5 disorders and proliferative disorders. For example, in one embodiment, a host cell of the 
invention is a fertilized oocyte or an embryonic stem cell into which AGS-coding 
sequences have been introduced. Such host cells can then be used to create non-human 
transgenic animals in which exogenous AGS sequences have been introduced into their 
genome or homologous recombinant animals in which endogenous AGS sequences have 

10 been altered. Such animals are useful for studying the function and/or activity of AGS 
and for identifying and/or evaluating modulators of AGS activity. As used herein, a 
"transgenic animal" is a nonhuman animal, preferably a mammal, more preferably a 
rodent such as a rat or mouse, in which one or more of the cells of the animal includes a 
transgene. Other examples of transgenic animals include nonhuman primates, sheep, 

15 dogs, cows, goats, chickens, amphibians, etc. A transgene is exogenous DNA which is 
integrated into the genome of a cell from which a transgenic animal develops and which 
remains in the genome of the mature animal, thereby directing the expression of an 
encoded gene product in one or more cell types or tissues of the transgenic animal. As 
used herein, a "homologous recombinant animal" is a nonhuman animal, preferably a 

20 mammal, more preferably a mouse, in which an endogenous AGS gene has been altered 
by homologous recombination between the endogenous gene and an exogenous DNA 
molecule introduced into a cell of the animal, e.g., an embryonic cell of the animal, prior 
to development of the animal. 

A transgenic animal of the invention can be created by introducing AGS- 

25 encoding nucleic acid into the male pronuclei of a fertilized oocyte, e.g. , by 
microinjection, retroviral infection, and allowing the oocyte to develop in a 
pseudopregnant female foster animal. The AGS cDNA sequence of SEQ ID NO:l can 
be used as a transgene. Alternatively, a non-human homologue of the AGS genes can be 
isolated based on hybridization to the AGS cDNA {e.g., hybridization to SEQ ID NO:l, 

30 described further in subsection I above) and used as a transgene. Intronic sequences and 
polyadenylation signals can also be included in the transgene to increase the efficiency 
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of expression of the transgene. A tissue-specific regulatory sequence^) can be operably 
linked to an AGS transgene to direct expression of AGS protein to particular cells. 
Methods for generating transgenic animals via embryo manipulation and microinjection, 
particularly animals such as mice, have become conventional in the art and are 
5 described, for example, in U.S. Patent Nos. 4,736,866 and 4,870,009, both by Leder et 
al., U.S. Patent No. 4,873,191 by Wagner et al. and in Hogan, B., Manipulating the 
Mouse Embryo, (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 
1986). Similar methods are used for production of other transgenic animals. A 
transgenic founder animal can be identified based upon the presence of an AGS 
transgene in its genome and/or expression of AGS mRNA in tissues or cells of the 
animals. A transgenic founder animal can then be used to breed additional animals 
carrying the transgene. Moreover, transgenic animals carrying a transgene encoding 
AGS can further be bred to other transgenic animals carrying other transgenes. 

To create a homologous recombinant animal, a vector is prepared which contains 
at least a portion of an AGS gene into which a deletion, addition or substitution has been 
introduced to thereby alter, ^.functionally disrupt, the AGS gene. The AGS gene can 
be a human gene {e.g., SEQ ID NO:l), but more preferably, is a nonhuman homologue 
of a human AGS gene {e.g., a murine homologue). For example, a mouse AGS gene 
be used to construct a homologous recombination vector suitable for altering an 
endogenous AGS gene in the mouse genome. In a preferred embodiment, the vector i 
designed such that, upon homologous recombination, the endogenous AGS gene i. 
functionally disrupted {e.g., no longer encodes a functional protein; also referred to 
"knock out" vector). Alternatively, the vector can be designed such that, upon 
homologous recombination, the endogenous AGS gene is mutated or otherwise altered 
but still encodes functional protein {e.g. , the upstream regulatory region can be altered to 
thereby alter the expression of the endogenous AGS protein). In the homologous 
recombination vector, the altered portion of the AGS gene is flanked at its 5' and 3' ends 
by additional nucleic acid of the AGS gene to allow for homologous recombination to 
occur between the exogenous AGS gene carried by the vector and an endogenous AGS 
gene in an embryonic stem cell. The additional flanking AGS nucleic acid is of 
sufficient length for successful homologous recombination with the endogenous gene. 
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Typicaily, several kilobases of flanking DNA (both at the 5' and 3' ends) are included in 
the vector (see e.g., Thomas, K.R. and Capecchi, M. R. (1987) Cell 5 1 :503 for a 
description of homologous recombination vectors). The vector is introduced into an 
embryonic stem cell line (e.g., by electroporation) and cells in which the introduced 

5 AGS gene has homologously recombined with the endogenous AGS gene are selected 
(see e.g., Li, E. et al. (1992) Cell 69:915). The selected cells are then injected into a 
blastocyst of an animal (e.g., a mouse) to form aggregation chimeras (see e.g., Bradley, 
A. in Teratocarcinomas and Embryonic Stem Cells: A Practical Approach, E.J. 
Robertson, ed. (IRL, Oxford, 1987) pp. 1 13-152). A chimeric embryo can then be 

10 implanted into a suitable pseudopregnant female foster animal and the embryo brought 
to term. Progeny harboring the homologously recombined DNA in their germ cells can 
be used to breed animals in which all cells of the animal contain the homologously 
recombined DNA by germline transmission of the transgene. Methods for constructing 
homologous recombination vectors and homologous recombinant animals are described 

15 further in Bradley, A. (1991) Current Opinion in Biotechnology 2:823-829 and in PCT 
International Publication Nos.: WO 90/1 1354 by Le Mouellec et al.; WO 91/01 140 by 
Smithies et al.; WO 92/0968 by Zijlstra et al.; and WO 93/04169 by Berns et al. 

In another embodiment, transgenic nonhumans animals can be produced which 
contain selected systems which allow for regulated expression of the transgene. One 

20 example of such a system is the cre/loxP recombinase system of bacteriophage PI. For 
a description of the cre/loxP recombinase system, see, e.g., Lakso et al. (1992) PNAS 
89:6232-6236. Another example of a recombinase system is the FLP recombinase 
system of Saccharomyces cerevisiae (O'Gorman et al. (1991) Science 251:1351-1355. If 
a cre/loxP recombinase system is used to regulate expression of the transgene, animals 

25 containing transgenes encoding both the Cre recombinase and a selected protein are 
required. Such animals can be provided through the construction of "double" transgenic 
animals, e.g., by mating two transgenic animals, one containing a transgene encoding a 
selected protein and the other containing a transgene encoding a recombinase. Clones of 
the nonhuman transgenic animals described herein can also be produced according to the 

30 methods described in Wilmut, I. et al. (1 997) Nature 385:8 10-813. 
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III. Isola ted AGS Proteins and Anti- AGS Antibodies 

Another aspect of the invention pertains to isolated AGS proteins, and 
biologically active portions thereof, as well as peptide fragments suitable for use as 
immunogens to raise anti- AGS antibodies. An "isolated" or "purified" protein or 
biologically active portion thereof is substantially free of cellular material when 
produced by recombinant DNA techniques, or chemical precursors or other chemicals 
when chemically synthesized. The language "substantially free of cellular material" 
includes preparations of AGS protein in which the protein is separated from cellular 
components of the cells in which it is naturally or recombinant^ produced. In one 
embodiment, the language "substantially free of cellular material" includes preparations 
of AGS protein having less than about 30% (by dry weight) of non- AGS protein (also 
referred to herein as a "contaminating protein"), more preferably less than about 20% of 
non- AGS protein, still more preferably less than about 10% of non- AGS protein, and 
most preferably less than about 5% non- AGS protein. When the AGS protein or 
biologically active portion thereof is recombinantly produced, it is also preferably 
substantially free of culture medium, e.g., culture medium represents less than about 
20%, more preferably less than about 10%, and most preferably less than about 5% of 
the volume of the protein preparation. The language "substantially free of chemical 
precursors or other chemicals" includes preparations of AGS protein in which the protein 
is separated from chemical precursors or other chemicals which are involved in the 
synthesis of the protein. In one embodiment, the language "substantially free of 
chemical precursors or other chemicals" includes preparations of AGS protein having 
less than about 30% (by dry weight) of chemical precursors or non- AGS chemicals, 
more preferably less than about 20% chemical precursors or non- AGS chemicals, still 
more preferably less than about 10% chemical precursors or non- AGS chemicals, and 
most preferably less than about 5% chemical precursors or non- AGS chemicals. In 
preferred embodiments, isolated proteins or biologically active portions thereof lack 
contaminating proteins from the same animal from which the AGS protein is derived. 
Typically, such proteins are produced by recombinant expression of, for example, a 
human AGS protein in a nonhuman cell. 
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An isolated AGS protein or a portion thereof of the invention can modulate a G- 
protein mediated response in a cell. In preferred embodiments, the protein or portion 
thereof comprises an amino acid sequence which is sufficiently homologous to an amino 
acid sequence of SEQ ID NO:2 such that the protein or portion thereof maintains the 

5 ability to modulate a G-protein mediated response in a cell. The portion of the protein 
is preferably a biologically active portion as described herein. In another preferred 
embodiment, the AGS protein has an amino acid sequence shown in SEQ ID NO:2. In 
yet another preferred embodiment, the AGS protein has an amino acid sequence which is 
encoded by a nucleotide sequence which hybridizes, e.g., hybridizes under stringent 

10 conditions, to the nucleotide sequence of SEQ ID NO:l or SEQ ID NO:3. In still 
another preferred embodiment, the AGS protein has an amino acid sequence which is 
encoded by a nucleotide sequence that is at least 60%, preferably at least 70%, more 
preferably at least 80%, and even more preferably at least 90%, or 95%, or 96%, or 97%, 
or 98%, or 99% homologous to the nucleotide sequence of SEQ ID NO:l or SEQ ID 

15 NO: 3. The preferred AGS proteins of the present invention also preferably possess at 
least one of the AGS activities described herein. For example, a preferred AGS protein 
of the present invention includes an amino acid sequence encoded by a nucleotide 
sequence which hybridizes, e.g., hybridizes under stringent conditions, to the nucleotide 
sequence SEQ ID NO: 1 or SEQ ID NO: 3 and which can modulate a G-protein mediated 

20 response in a cell. 

In other embodiments, the AGS protein is substantially homologous to the amino 
acid sequence of SEQ ID NO:2 and retains the functional activity of the protein of SEQ 
ID NO:2 yet differs in amino acid sequence due to natural allelic variation or 
mutagenesis, as described in detail in subsection I above. Accordingly, in another 

25 embodiment, the AGS protein is a protein which comprises an amino acid sequence 

which is at least 60%, preferably at least 80%, and more preferably at least 86%, or 88%, 
or 90%, and most preferably at least 95%, or 96%, or 97%, or 98%, or 99% homologous 
to the entire amino acid sequence of SEQ ID NO:2 and which has at least one of the 
AGS activities described herein. In other embodiments, the invention pertains to a full 

30 length human protein which is substantially homologous to the entire amino acid 
sequence of SEQ ID NO:2. 
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Biologically active portions of the AGS protein include peptides comprising 
ammo acid sequences derived from the amino acid sequence of the AGS protein e g. 
the amino acid sequence shown in SEQ ID NO:2 or the amino acid sequence of 1 protein 
homologous to the AGS protein, which include less amino adds than the foil length 
5 AGS protein or the foil length protein which is homologous to the AGS protein and 
exlnbit at least one activity of the AGS protein. Typ.ca.ly, biologically active portions 
(Wades,*.*, peptides which are, for example, 5, 10, 15,20,30,35,36,37,38,39 40 
50, 100 or more amino acids in length) comprise a domain or motif, with at least one ' 
activity of the AGS protein. In a preferred embodiment, the biologically active portion 
0 of the protem includes a motif selected from the following: the phosphate/magnesium 
bmding reg 10 ns GXXXXGK(S/T)(SEQ ID NO: 18) (the P-looo) and DXXG (SEQ ID 
NO: 19), the guanine base binding loops NKXD (SEQ ID NO: 20) and EXSAK (SEQ 
ID NO: 21) the motif regions G-l through G-5, characteristic of GTPases, the C-terminal 
CAAX (SEQ ID NO: 22) motif, and/or the QAKDKER motif (SEQ ID N0 23) and can 
• modulate the activity of a G-protein. Moreover, other biologically active portions in 
which other regions of the protein are deleted, can be prepared by recombinant 
techniques and evaluated for one or more of the activities described herein. Preferably 
the biologically active portions of the AGS protein include one or more selected 
domains/motifs or portions thereof having biological activity. 

AGS proteins are preferably produced by recombinant DNA techniques For 
example, a nucleic acid molecule encoding the protein is cloned into an expression 
vector (as described above), the expression vector is introduced into a host cell (as 
described above) and the AGS protein is expressed in the host cell. The AGS protein 
can then be isolated from the cells by an appropriate purification scheme using standard 
protem purification techniques. Alternative to recombinant expression, an AGS protein 
polypeptide, or peptide can be synthesized chemically using standard peptide synthesis ' 
techniques. Moreover, native AGS protein can be isolated from cells, for example using 
an anti- AGS antibody (described further below). 

The invention also provides AGS chimeric or fusion proteins. As used herein an 
AGS "chimeric protein" or "fusion protein" comprises an AGS polypeptide operatively 
hnked to a non- AGS polypeptide. An " AGS polypeptide" refers to a polypeptide 
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having an amino acid sequence corresponding to AGS, whereas a "non- AGS 
polypeptide" refers to a polypeptide having an amino acid sequence corresponding to a 
protein which is not substantially homologous to the AGS protein, e.g., a protein which 
is different from the AGS protein and which is derived from the same or a different 

5 organism. Within the fusion protein, the term "operatively linked" is intended to 

indicate that the AGS polypeptide and the non- AGS polypeptide are fused in-frame to 
each other. The non- AGS polypeptide can be fused to the N-terminus or C-terminus of 
the AGS polypeptide. For example, in one embodiment the fusion protein is a GST- 
AGS fusion protein in which the AGS sequences are fused to the C-terminus of the GST 

10 sequences. Such fusion proteins can facilitate the purification of recombinant AGS. In 
another embodiment, the fusion protein is an AGS protein containing a heterologous 
signal sequence at its N-terminus. In certain host cells (e.g., mammalian host cells), 
expression and/or secretion of AGS can be increased through use of a heterologous 
signal sequence. 

1 5 Preferably, an AGS chimeric or fusion protein of the invention is produced by 

standard recombinant DNA techniques. For example, DNA fragments coding for the 
different polypeptide sequences are ligated together in-frame in accordance with 
conventional techniques, for example by employing blunt-ended or stagger-ended 
termini for ligation, restriction enzyme digestion to provide for appropriate termini, 

20 filling-in of cohesive ends as appropriate, alkaline phosphatase treatment to avoid 

undesirable joining, and enzymatic ligation. In another embodiment, the fusion gene can 
be synthesized by conventional techniques including automated DNA synthesizers. 
Alternatively, PCR amplification of gene fragments can be carried out using anchor 
primers which give rise to complementary overhangs between two consecutive gene 

25 fragments which can subsequently be annealed and reamplified to generate a chimeric 
gene sequence (see, for example, Current Protocols in Molecular Biology, eds. Ausubel 
et al. John Wiley & Sons: 1992). Moreover, many expression vectors are commercially 
available that already encode a fusion moiety (e.g., a GST polypeptide). An AGS- 
encoding nucleic acid can be cloned into such an expression vector such that the fusion 

30 moiety is linked in-frame to the AGS protein. 
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The present invention also pertains to homologues of the AGS proteins which 
function as either an AGS agonist (mimetic) or an AGS antagonist. In a preferred 
embodiment, the AGS agonists and antagonists stimulate or inhibit, respectively, a 
subset of the biological activities of the naturally occurring form of the AGS proiein. 
5 Thus, specific biological effects can be elicited by treatment with a homologue of 
limited function. In one embodiment, treatment of a subject with a homologue having a 
subset of the biological activities of the naturally occurring form of the protein has fewer 
side effects in a subject relative to treatment with the naturally occurring form of the 
AGS protein. 

0 Homologues of the AGS protein can be generated by mutagenesis, e.g., discrete 

point mutation or truncation of the AGS protein. As used herein, the term "homologue" 
refers to a variant form of the AGS protein which acts as an agonist or antagonist of the 
activity of the AGS protein. An agonist of the AGS protein can retain substantially the 
same, or a subset, of the biological activities of the AGS protein. An antagonist of the 
AGS protein can inhibit one or more of the activities of the naturally occurring form of 
the AGS protein, by, for example, competitively binding to a G-protein or downstream 
or upstream member of the pheromone response cascade. 

In an alternative embodiment, homologues of the AGS protein can be identified 
by screening combinatorial libraries of mutants, e.g. , truncation mutants, of the AGS 
protein for AGS protein agonist or antagonist activity. In one embodiment, a variegated 
library of AGS variants is generated by combinatorial mutagenesis at the nucleic acid 
level and is encoded by a variegated gene library. A variegated library of AGS variants 
can be produced by, for example, enzymatically ligating a mixture of synthetic 
oligonucleotides into gene sequences such that a degenerate set of potential AGS 
sequences is expressible as individual polypeptides, or alternatively, as a set of larger 
fusion proteins (e.g., for phage display) containing the set of AGS sequences therein. 
There are a variety of methods which can be used to produce libraries of potential AGS 
homologues from a degenerate oligonucleotide sequence. Chemical synthesis of a 
degenerate gene sequence can be performed in an automatic DNA synthesizer, and the 
synthetic gene then ligated into an appropriate expression vector. Use of a degenerate 
set of genes allows for the provision, in one mixture, of all of the sequences encoding the 
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desired set of potential AGS sequences. Methods for synthesizing degenerate 
oligonucleotides are known in the art (see, e.g., Narang, S.A. (1983) Tetrahedron 39:3; 
Itakura et al. (1984) Annu. Rev. Biochem. 53:323; Itakura et al. (1984) Science 198:1056; 
Ike et al. (1983) Nucleic Acid Res. 1 1:477. 

5 In addition, libraries of fragments of the AGS protein coding can be used to 

generate a variegated population of AGS fragments for screening and subsequent 
selection of homologues of an AGS protein. In one embodiment, a library of coding 
sequence fragments can be generated by treating a double stranded PCR fragment of an 
AGS coding sequence with a nuclease under conditions wherein nicking occurs only 

10 about once per molecule, denaturing the double stranded DNA, renaturing the DNA to 
form double stranded DNA which can include sense/antisense pairs from different 
nicked products, removing single stranded portions from reformed duplexes by treatment 
with SI nuclease, and ligating the resulting fragment library into an expression vector. 
By this method, an expression library can be derived which encodes N-terminal, C- 

15 terminal and internal fragments of various sizes of the AGS protein. 

Several techniques are known in the art for screening gene products of 
combinatorial libraries made by point mutations or truncation, and for screening cDNA 
libraries for gene products having a selected property. Such techniques are adaptable for 
rapid screening of the gene libraries generated by the combinatorial mutagenesis of AGS 

20 homologues. The most widely used techniques, which are amenable to high through-put 
analysis, for screening large gene libraries typically include cloning the gene library into 
replicable expression vectors, transforming appropriate cells with the resulting library of 
vectors, and expressing the combinatorial genes under conditions in which detection of a 
desired activity facilitates isolation of the vector encoding the gene whose product was 

25 detected. Recursive ensemble mutagenesis (REM), a new technique which enhances the 
frequency of functional mutants in the libraries, can be used in combination with the 
screening assays to identify AGS homologues (Arkin and Yourvan (1992) PNAS 
59:781 1-7815; Delgrave et al. (1993) Protein Engineering 6(3):327-33 1). 

An isolated AGS protein, or a portion or fragment thereof, can be used as an 

30 immunogen to generate antibodies that bind AGS using standard techniques for 

polyclonal and monoclonal antibody preparation. The ftill-length AGS protein can be 
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— or. alternatively, the invention provtdes m ,ig«„i c peptide „ f AQS for 

as tmmunogens. The antigenic peptide of A0S Mmprises „ ^ , ^ ^ ^ 
of the ammo acid sequence show , in g E Q , D N0;2 ^ encompassK m ej) . ^ ^ 
sue tha, an antibody raised apins , ^ ^ foms a specifc ^ ^ 

AGS. P"^bly, t hea nl , 8 ™ cp ep, i decon,pH se sa t leas,,0ami„„ac,d resilll , es more 
prefer a. leas, 15 amino add residues, even more preferably a, leas, 20 amino'acid 
residues, and most preferably a, leas, 30 amino acid residues. p refe „, d 
encompassed by ,he antigenic peptide are region of AGS that are located on the surface 
ofthe proton, hydrophilic regions and/or regions , hat are unique „ A0S t , nol 
10 common to all small G proteins, ' 

An AGS immunogen typically is used to prepare antibodies by burning a 
-table subject, , rabbi, goat, mouse o, other mamma,, with the tmmunogen An 

immunogenic preparation can contain, for sample, recombinantly expressed 
AGSprotcnorachem.ca.lysynmesizedAGSpeptidc. ^preparation can ftaher 
mclude an adjuvant, such as Frcund's compiete o, tncomptae adjuvant, „ r similar 

~* ■—*» *» -bjec, win, an immunogemc 

AGS preparation induces .polyclonal and- AGS antibody response. 

Accordingly, another aspect of the invention pertains to a„,i. A0S antibodies 
The term -antibody as used herein refers to immunoglobulin molecules and 
20 ,mmun„|„gi ca ,, v active portions of immunoglobulin molecules, , s . , moieculcs mat 
contam an antigen binding site which specially binds (immunoreacts with, an antigen 
such as AGS. Examples of immunologically active porttons of immunoglobulin 
n.o,ecu,es include F(ab, and F,ab', 2 fragment which can be genenaed by treating me 
antibody with m enz vme SU c„ as pepsin. The invention p,„v,des polyclonal and 
» antibodies that bind AGS. The term "monoclonal ™,ibody 0 r -monoclonal 

™,body condition-, as UK d herein, refers to a population of antibody molecules tha, 
ontatn only one species of an antigen binding site capable of immunoreacting with a 
particular epitope of ACS. A monoclonal arttibody composition thus typically displays, 
angle btnding affinity for a particular AGS protein with which i, immunoreacts 
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Polyclonal anti- AGS antibodies can be prepared as described above by 
immunizing a suitable subject with an AGS immunogen. The anti- AGS antibody titer 
in the immunized subject can be monitored over time by standard techniques, such as 
with an enzyme linked immunosorbent assay (ELISA) using immobilized AGS. If 
5 desired, the antibody molecules directed against AGS can be isolated from the mammal 
(e.g., from the blood) and further purified by well known techniques, such as protein A 
chromatography to obtain the IgG fraction. At an appropriate time after immunization, 
e.g., when the anti- AGS antibody titers are highest, antibody-producing cells can be 
obtained from the subject and used to prepare monoclonal antibodies by standard 
10 techniques, such as the hybridoma technique originally described by Kohler and Milstein 
(1975) Nature 256:495-497) (see also, Brown et al. (1981) J. Immunol. 127:539-46; 
Brown et al. (1980)./. Biol. Chem .255:4980-83; Yeh et al. (1976) PNAS 76:2927-31; 
and Yeh et al. (1982) Int. J. Cancer 29:269-75), the more recent human B cell 
hybridoma technique (Kozbor et al. (1983) Immunol Today 4:72), the EBV-hybridoma 
15 technique (Cole et al. (1985), Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, 
Inc., pp. 77-96) or trioma techniques. The technology for producing monoclonal 
antibody hybridomas is well known (see generally R. H. Kenneth, in Monoclonal 
Antibodies: A New Dimension In Biological Analyses, Plenum Publishing Corp., New 
York, New York (1980); E. A. Lemer (1981) Yale J. Biol. Med. 54:387-402; M. L. 
20 Gefter et al. (1977) Somatic Cell Genet. 3:231-36). 

Any of the many well known protocols used for fusing lymphocytes and 
immortalized cell lines can be applied for the purpose of generating an anti- AGS 
monoclonal antibody (see, e.g., G. Galfre et al. (1977) Nature 266:55052; Gefter et al. 
Somatic Cell Genet., cited supra; Lerner, Yale J. Biol. Med, cited supra; Kenneth, 
25 Monoclonal Antibodies, cited supra). Hybridoma cells producing a monoclonal 

antibody of the invention are detected by screening the hybridoma culture supematants 
for antibodies that bind AGS, e.g. , using a standard ELISA assay. 

Alternative to preparing monoclonal antibody-secreting hybridomas, a 
monoclonal anti- AGS antibody can be identified and isolated by screening a 
30 recombinant combinatorial immunoglobulin library (e.g. , an antibody phage display 
library) with AGS to thereby isolate immunoglobulin library members that bind AGS. 
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1559); Morrison, S. L. (\n5)Science 229:1202-1207; Oi et al. (1986) BioTechniques 
4:214; Winter U.S. Patent 5,225,539; Jones et al. (1986) Nature 321:552-525; 
Verhoeyan et al. (1988) Science 239:1534; and Beidler et al. (1988) 1 Immunol. 
141:4053-4060. 

5 An anti- AGS antibody (e.g., monoclonal antibody) can be used to isolate AGS 

by standard techniques, such as affinity chromatography or immunoprecipitation. An 
anti- AGS antibody can facilitate the purification of natural AGS from cells and of 
recombinantly produced AGS expressed in host cells. Moreover, an anti- AGS antibody 
can be used to detect AGS protein (e.g., in a cellular lysate or cell supernatant) in order 

10 to evaluate the abundance, pattern of expression, and/or subcellular localization of the 
AGS protein. Anti- AGS antibodies can be used diagnostically to monitor protein levels 
in tissue as part of a clinical testing procedure, e.g., to, for example, determine the 
efficacy of a given treatment regimen. Detection can be facilitated by coupling (e.g., 
physically linking) the antibody to a detectable substance. Examples of detectable 

1 5 substances include various enzymes, prosthetic groups, fluorescent materials, 

luminescent materials, bioluminescent materials, and radioactive materials. Examples of 
suitable enzymes include horseradish peroxidase, alkaline phosphatase, p-galactosidase, 
or acetylcholinesterase; examples of suitable prosthetic group complexes include 
streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials include 

20 umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, 

dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an example of a 
luminescent material includes luminol; examples of bioluminescent materials include 
luciferase, luciferin, and aequorin, and examples of suitable radioactive material include 
125 I, 131 I, 35 Sor 3 H. 

25 

IV. Pharmaceutical Compositions 

The AGS nucleic acid molecules, AGS proteins, AGS modulators, and anti- AGS 
antibodies (also referred to herein as "active compounds") of the invention can be 
incorporated into pharmaceutical compositions suitable for administration to a subject, 
30 e.g., a human. Such compositions typically comprise the nucleic acid molecule, protein, 
modulator, or antibody and a pharmaceutical^ acceptable carrier. As used herein the 
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polyetheylene glycol, and the like), and suitable mixtures thereof. The proper fluidity 
can be maintained, for example, by the use of a coating such as lecithin, by the 
maintenance of the required particle size in the case of dispersion and by the use of 
surfactants. Prevention of the action of microorganisms can be achieved by various 

5 antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, 
ascorbic acid, thimerosal, and the like. In many cases, it will be preferable to include 
isotonic agents, for example, sugars, polyalcohols such as manitol, sorbitol, sodium 
chloride in the composition. Prolonged absorption of the injectable compositions can be 
brought about by including in the composition an agent which delays absorption, for 

10 example, aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by incorporating the active compound 
(e.g., an AGS protein or anti- AGS antibody) in the required amount in an appropriate 
solvent with one or a combination of ingredients enumerated above, as required, 
followed by filtered sterilization. Generally, dispersions are prepared by incorporating 

15 the active compound into a sterile vehicle which contains a basic dispersion medium and 
the required other ingredients from those enumerated above. In the case of sterile 
powders for the preparation of sterile injectable solutions, the preferred methods of 
preparation are vacuum drying and freeze-drying which yields a powder of the active 
ingredient plus any additional desired ingredient from a previously sterile-filtered 

20 solution thereof 

Oral compositions generally include an inert diluent or an edible carrier. They 
can be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral 
therapeutic administration, the active compound can be incorporated with excipients and 
used in the form of tablets, troches, or capsules. Oral compositions can also be prepared 

25 using a fluid carrier for use as a mouthwash, wherein the compound in the fluid carrier is 
applied orally and swished and expectorated or swallowed. Pharmaceutical^ 
compatible binding agents, and/or adjuvant materials can be included as part of the 
composition. The tablets, pills, capsules, troches and the like can contain any of the 
following ingredients, or compounds of a similar nature: a binder such as 

30 microcrystalline cellulose, gum tragacanth or gelatin; an excipient such as starch or 
lactose, a disintegrating agent such as alginic acid, Primogel, or corn starch; a lubricant 
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to be treated; each unit containing a predetermined quantity of active compound 
calculated to produce the desired therapeutic effect in association with the required 
pharmaceutical carrier. The specification for the dosage unit forms of the invention are 
dictated by and directly dependent on the unique characteristics of the active compound 

5 and the particular therapeutic effect to be achieved, and the limitations inherent in the art 
of compounding such an active compound for the treatment of individuals. 

The nucleic acid molecules of the invention can be inserted into vectors and used 
as gene therapy vectors. Gene therapy vectors can be delivered to a subject by, for 
example, intravenous injection, local administration (see U.S. Patent 5,328,470) or by 

10 stereotactic injection (see e.g., Chen etal. (1994) 91:3054-3057). The 

pharmaceutical preparation of the gene therapy vector can include the gene therapy 
vector in an acceptable diluent, or can comprise a slow release matrix in which the gene 
delivery vehicle is imbedded. Alternatively, where the complete gene delivery vector 
can be produced intact from recombinant cells, e.g. retroviral vectors, the pharmaceutical 

15 preparation can include one or more cells which produce the gene delivery system. 

The pharmaceutical compositions can be included in a container, pack, or 
dispenser together with instructions for administration. 

V. Uses and Methods of the Invention 

20 An AGS protein of the invention has one or more of the activities described 

herein and can thus be used, for example, to screen drugs or compounds which modulate 
AGS protein activity as well as to treat disorders characterized by insufficient or 
excessive production of AGS protein or production of AGS protein forms which have 
altered (e.g., increased or decreased) activity compared to wild type AGS. In addition, 

25 methods are provided that employ AGS molecules and rely on strategies based upon 
functional readouts using the yeast Saccharomyces cerevisiae, and these methods offer a 
fast and more reliable way to identify genes whose expression affects key aspects of 
cellular function. Accordingly, these methods have distinct advantages over 
transcriptional profiling screens, computer-based screens, screens based on protein- 

30 protein interactions, and other transgenic and cell-based functional screens that have 
been developed in higher eukaryotes (Spence, (1998) Drug Disc. Today 3:179-188; 
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Simonsen eta,., (1994) Trends Pharmacol. Sci. 15:437-441; Evans et a,., (1997) Trends 
Gene, 13:370-374; Hicks et a,., (, 99 7) Na, Genet. 16:338-344; and Whitney et al 
(1998) M* Biotech. .6:1329-1333). Moreover, the isolated nucle.c ac.d molecules' of 
the invention can be used to express AGS protein (e.g., via a recombinant expression 
5 vector ,n a host cell in gene therapy applications), to detect AGS mRNA (e g. in a 
biological sample) or a genetic lesion in an AGS gene, and to modulate AGS activity as 
descnbed further below. Moreover, the ami- AGS anybodies of the invention can be' 
used to detect and isolate AGS protein and modulate AGS protem activity. 

0 a - Drug Screening Assays : 

The invention provides methods for identifying compounds or agents which 
modulate AGS protein activity and/or AGS nucleic acid expression. These methods are 
also referred to herein as drug screening assays and typically include the step of 
screening a candidate/test compound or agent for the ability to interact with (. g. bind 
» to) an AGS protein, to modulate the interaction of an AGS protein and a target molecule 
and/or to modulate AGS nucleic acid expression and/or AGS protein activity, and/or to ' 
modulate signal transduction mediated at least in part by an AGS protein. Candidate/test 
compounds or agents which have one or more of these abilities can be used as drugs to 
treat borders characterized by aberrant or abnormal AGS protein activity and/or AGS 
nucleic acid expression. Candidate/test compounds such as small molecules eg. small 
orgamc molecules, and other drug candidates can be obtained, for example, from 
combinatorial and natural product libraries. 

In a preferred embodiment, the invention provides a method for identifying a compound 
that modulates signal transduction in a cell, comprising the steps of contacting a cell that 
expresses an AGS protein with a test compound and determining the effect of the test 
compound on the activity of the AGS protein and identifying the test compound as a 
modulator of signal transduction based on the ability of the compound to modulate the 
activity of the AGS protein in the cell. 

As used herein, the term "identify" as used in the context of "identifying a 
compound" refers to the identification of compounds for which an activity as an 
activator of G protein signaling has not been previously recognized or demonstrated 
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The term "identifying" according to the methods of the present invention is intended to 
refer to identifying, screening and/or selecting of test compound, for example selecting 
active compounds not previously recognized as activators of G protein signaling for 
further analysis and testing. In a preferred non-limiting example, compounds 

5 "identified" according to the methods of the present invention can be used as test 

compounds in a second assay to confirm a G protein activating activity. Alternatively, 
compounds "identified" according to the methods of the present invention can be tested 
for other desirable activities or can be tested, for example, at varying doses to determine 
the efficacy of the compound. Compounds "identified" according to the methods of the 

10 present invention can be also tested in cell culture models or in animal models of 
disease. 

In a preferred embodiment, the AGS protein comprises an amino acid sequence 
having at least 86% identity to SEQ ID NO: 2 and stimulates G protein activity in a 
receptor-independent manner. In a particularly preferred embodiment, the AGS protein 
15 comprises the amino acid sequence of SEQ ID NO 2. Alternatively, the AGS protein 
can comprise a structure as described above in the sections discussing AGS proteins and 
nucleic acids. 

In a preferred embodiment, of the cell-based screening method, the cell has been 
engineered to express the AGS protein by introducing into the cell an expression vector 

20 encoding the AGS protein. The cell can be further engineered to express other proteins, 
such as a G protein a subunit. In a preferred embodiment, the cell is a yeast cell that has 
been engineered to express an AGS protein and a mammalian or chimeric G protein a 
subunit and the effect of the test compound on the activity of the AGS protein is 
determined by monitoring a pheromone response pathway in the yeast cells. A preferred 

25 chimeric G protein subunit which the yeast cell is engineered to express is a Gpal -Gai2 
chimeric G protein a subunit (preferably comprising 41 amino-terminal amino acids 
from yeast Gpal operatively linked to a mammalian Gai2). The pheromone response 
pathway in the yeast cells can be monitored, for example, by measuring the activity of a 
pheromone responsive promoter in the yeast cells. Yeast cell compositions and methods 
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that can be used for screening modulators of AGS proteins are described in further detail 
below and in the Examples. 
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a complex which includes the AGS protein and the target molecule or detecting the 
interaction/reaction of the AGS protein and the target molecule. Detection of complex 
formation can include direct quantitation of the complex by, for example, measuring 
inductive effects of the AGS protein. A statistically significant change, such as a 

5 decrease, in the interaction of the AGS and target molecule (e.g. , in the formation of a 
complex between the AGS and the target molecule) in the presence of a candidate 
compound (relative to what is detected in the absence of the candidate compound) is 
indicative of a modulation (e.g., stimulation or inhibition) of the interaction between the 
AGS protein and the target molecule. Modulation of the formation of complexes 

10 between the AGS protein and the target molecule can be quantitated using, for example, 
an immunoassay. 

To perform the above drug screening assays, it may be desirable to immobilize 
either AGS or its target molecule to facilitate separation of complexes from 
uncomplexed forms of one or both of the proteins, as well as to accommodate 

1 5 automation of the assay. Interaction (e.g. , binding of) of AGS to a target molecule, in 
the presence and absence of a candidate compound, can be accomplished in any vessel 
suitable for containing the reactants. Examples of such vessels include microtitre plates, 
test tubes, and micro-centrifuge tubes. In one embodiment, a fusion protein can be 
provided which adds a domain that allows the protein to be bound to a matrix. For 

20 example, glutathione-S-transferase/ AGS fusion proteins can be adsorbed onto 

glutathione sepharose beads (Sigma Chemical, St. Louis, MO) or glutathione derivatized 
microtitre plates, which are then combined with the cell ly sates (e.g. 35 S-labeled) and 
the candidate compound, and the mixture incubated under conditions conducive to 
complex formation (e.g., at physiological conditions for salt and pH). Following 

25 incubation, the beads are washed to remove any unbound label, and the matrix 
immobilized and radiolabel determined directly, or in the supernatant after the 
complexes are dissociated. Alternatively, the complexes can be dissociated from the 
matrix, separated by SDS-PAGE, and the level of AGS-binding protein found in the 
bead fraction quantitated from the gel using standard electrophoretic techniques. 
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Other techniques for immobilizing proteins on matrices can also be used in the 
drug screenmg assays of the invention. For example, either AGS or its target molecule 
can be mobilized utilizing conjugation of biotin and streptavidin. Biotinylated AGS 
molecules can be prepared from biotin-NHS (N-hydroxy-succmmnde) using techniques 
5 well known in the art (e.g., biotinylation kit, Pierce Chemicals, Rockford, IL) and 
immobilized in the wells of streptavidin-coated 96 well plates (Pierce Chemical) 
Alternatively, antibodies reactive with AGS but which do not mterfere with binding of 
the protem to its target molecule can be derivatized to the wells of the plate, and AGS 
trapped in the wells by antibody conjugation. As described above, preparations of an 
10 AGS-bmdmg protein and a candidate compound are incubated m the AGS-presenting 
wells of the plate, and the amount of complex trapped in the well can be quantitated 
Methods for detecting such complexes, m addition to those described above for the 
GST-immobilized complexes, include immunodetection of complexes using antibodies 
reactive with the AGS target molecule, or which are reactive with AGS protein and 
5 compete with the target molecule; as well as enzyme-linked assays which rely on 
detecting an enzymatic activity associated with the target molecule. 

The invention also provides cell-based assays for identifying compounds which 
modulate AGS activity. In one embodiment, the cell-based assay involves a method for 
'denting a compound which modulates (e.g., stimulates or inhibits) AGS activity 
mcludmg contacting a cell which contains an AGS protein with a test compound and 
determming the ability of the test compound to modulate the activity of the AGS protein 
In a preferred embodiment, determining the ability of the test compound to modulate 
AGS acuvrty includes determining the ability of the test compound to effect (e g. 
upregulate or down regulate) G-protein mediated signaling in the cell. For example G- 
protem mediated signaling can be determined in an AGS-containing cell prior to 
contacting the cell with a test compound and compared to the signaling after contacting 
the cell with a test compound. Compounds which reduce the G-protein mediated 
scaling can be identified as AGS antagonists whereas compounds which increase the 
G-protein mediated signaling can be identified as antagonists. 
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Alternatively, modulators of AGS expression can be identified in a method 
wherein a cell is contacted with a candidate compound and the expression of AGS 
mRNA or protein in the cell is determined. The level of expression of AGS mRNA or 
protein in the presence of the candidate compound is compared to the level of expression 

5 of AGS mRNA or protein in the absence of the candidate compound. The candidate 
compound can then be identified as a modulator of AGS nucleic acid expression based 
on this comparison. For example, when expression of AGS mRNA or protein is greater 
(statistically significantly greater) in the presence of the candidate compound than in its 
absence, the candidate compound is identified as a stimulator of AGS mRNA or protein 

10 expression. Alternatively, when expression of AGS mRNA or protein is less 

(statistically significantly less) in the presence of the candidate compound than in its 
absence, the candidate compound is identified as an inhibitor of AGS mRNA or protein 
expression. The level of AGS mRNA or protein expression in the cells can be 
determined by methods described herein for detecting AGS mRNA or protein. 

15 The present invention also provides cell-based assays for identifying compounds 

which modulate the activity of an AGS. Typically, cells for use in cell based assays are 
engineered to express a heterologous AGS protein. In preferred embodiments, the cells 
may be further engineered such that the endogenous AGS protein is not expressed in 
functional form. Cells used to express a heterologous AGS can be, for example, 

20 mammalian or yeast in origin. In preferred embodiments, the engineered cells of the 
present invention are yeast cells. 

In certain embodiments, the cells for use in the instant assays are further 
engineered to express a heterologous G protein coupled receptor which is functionally 
integrated into the signaling pathway of the cell in which it is expressed. In the case of 

25 yeast cells, heterologous GPCRs can be expressed in yeast cells and can be made to 
couple to yeast G proteins resulting in the transduction of signals via the endogenous 
yeast pheromone system signaling pathway which is normally activated by STE2 or 
STE3. In certain embodiments, such heterologous receptors can be made to couple more 
effectively to the yeast pheromone system signaling pathway by coexpressing a 

30 heterologous G protein subunit or subunits, by expressing a chimeric G protein subunit, 
or by expressing a chimeric G protein coupled receptor. Methods for preparing 
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engineered yeast cells, and engineered yeast cells themselves, are described in U S 
Patent No. 5,482,835 by King et al. and PCX Publication WO 94/23025 by Fowlkes et 
al.. the contents of both of which are hereby expressly incorporated herein by reference. 
In one embodiment of the assay, the effect of a test compound on AGS induced 
5 G protein activation can be measured by detecting changes in second messenger 

generation. Alternatively, in another embodiment, the effect of a test compound on an 
.nhtbitor of an activator of G-protein signaling can be detected. A variety of intracellular 
effectors have been identified as being G-protem-regulated, including adenylyl cyclase, 
cyclic GMP, phosphodiesterases, phosphoinositidase C, and phospholipase A 2 . In 

0 addition, G proteins interact with a range of ion channels and are able to inhibit certain 
volta ge -sens,tive Ca++ transients, as well as stimulating cardiac K+ channels. For 
example, in yeast ceils a reduction in the generation of AGS-induced second messengers 
or mating factor responses (e.g., growth arrest or shmoo formation) could be measured 

In one embodiment, GTP binding or GTPase enzymatic activity of G proteins 

1 can be measured in plasma membrane preparations by determining, respectively the 
incorporation of GTp/'s or breakdown of y"p GTP using techniques that are known in 
the art (For example, see Signal Transduction: A Practical Approach. G. Milligan Ed 
Oxford University Press, Oxford England). When receptors that modulate cAMP Jre 
tested, it will be possible to use standard techniques for cAMP detection, such as 
competitive assays which quantitate [3 H ]cAMP in the presence of unlabelled cAMP. 

Certain receptors stimulate the activity of phospholipase C which stimulates the 
breakdown of phosphatidylinositol 4,5, bisphosphate to 1,4,5-IP3 (which mobilizes 
intracellular Ca-H-) and diacylglycerol (DAG) (which activates protein kinase C). 
Inositol lipids can be extracted and analyzed using standard lipid extraction techniques. 
DAG can also be measured using thin-layer chromatography. Water soluble derivatives 
of all three inositol lipids (IP,, I P2 , I P3) can also be quantitated using radiolabelling 
techniques or HPLC. 

The mobilization of intracellular calcium or the influx of calcium from outside 
the cell can be measured using standard techniques. The choice of the appropriate 
calcium indicator, fluorescent, bioluminescent, metallochromic, or C a++ -sensitive 
micro-electrodes depends on the cell type and the magnitude and time constant of the 
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event under study (Borle (1990) Environ Health Perspect 84:45-56). As an exemplary 
method of Ca++ detection, cells could be loaded with the Ca++ sensitive fluorescent dye 
fura-2 or indo-1, using standard methods, and any change in Ca++ measured using a 
fluorometer. 

5 The other product of PIP2 breakdown, DAG can also be produced from 

phosphatidyl choline. The breakdown of this phospholipid in response to receptor- 
mediated signaling can also be measured using a variety of radiolabelling techniques. 

The activation of phospholipase A2 can easily be quantitated using known 
techniques, including, for example, the generation of arachadonate in the cell. 

10 In the case of certain receptors, it may be desirable to screen for changes in 

cellular phosphorylation. Such assay formats may be useful when the receptor of 
interest is a receptor tyrosine kinase. For example, yeast transformed with the FGF 
receptor and a ligand which binds the FGF receptor could be screened using colony 
immunoblotting (Lyons and Nelson (1984) Proc. Natl Acad Sci. USA 81:7426-7430) 

15 using anti-phosphotyrosine. In addition, tests for phosphorylation could be useful when 
a receptor which may not itself be a tyrosine kinase, activates protein kinases that 
function downstream in the signal transduction pathway. Likewise, it is noted that 
protein phosphorylation also plays a critical role in cascades that serve to amplify signals 
generated at the receptor. Multi-kinase cascades allow not only signal amplification but 

20 also signal divergence to multiple effectors that are often cell-type specific, allowing a 
growth factor to stimulate mitosis of one cell and differentiation of another. 

One such cascade is the MAP kinase pathway that appears to mediate both 
mitogenic, differentiation and stress responses in different cell types. Stimulation of 
growth factor receptors results in Ras activation followed by the sequential activation of 

25 c-Raf, MEK, and p44 and p42 MAP kinases (ERK1 and ERK2). Activated MAP kinase 
then phosphorylates many key regulatory proteins, including p90RSK and Elk-1 that are 
phosphorylated when MAP kinase translocates to the nucleus. Homologous pathways 
exist in mammalian and yeast cells. For instance, an essential part of the S. cerevisiae 
pheromone signaling pathway is comprised of a protein kinase cascade composed of the 

30 products of the STEU, STE7, and FUS3/KSS1 genes (the latter pair are distinct and 

functionally redundant). Accordingly, phosphorylation and/or activation of members of 
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this kinase cascade can be detected and used to quantitate receptor engagement. 
Phosphotyrosine specific antibodies are available to measure increases in tyrosine 
phosphorylation and phospho-specific antibodies are commercially available (New 
England Biolabs, Beverly, MA). 

5 Modified methods for detecting receptor-mediated signal transduction exist and 

one of skill in the art will recognize suitable methods that may be used to substitute for 
the example methods listed. 

In another embodiment, an indicator gene can be used for detection. In one 
embodiment an indicator gene is an unmodified endogenous gene. For example, in yeast 
10 cells, the instant method can rely on detecting the transcriptional level of such 

pheromone system pathway responsive endogenous genes as the BAR J or FUSJ FUS 2 
mating factor, STE3 STE13, KEX1, STE2, STE6, STE7, SST2, or CHS1. (Appletauer and' 
Zchstetter. 1989. Eur. J. Biochem. 181:243) 

In other embodiments, the sensitivity of an endogenous indicator gene can be 
1 5 enhanced by manipulating the promoter sequence at the natural locus for the indicator 
gene. Such manipulation may range from point mutations to the endogenous regulatory 
elements to gross replacement of all or substantial portions of the regulatory elements. 

For example, in the case of the BARJ gene, the promoter of the gene can be 
modified to enhance the transcription of BARJ upon activation of the yeast pheromone 
>-0 system pathway. BARJ gene transcription is activated upon exposure of yeast cells to 
mating factor. The sequence of the BARJ gene is known in the art (see e.g., U.S. patent 
4,613,572). Moreover, the sequences required for a-factor-enhanced expression of the 
Barl , and other pheromone responsive genes have been identified. (Appeltauer and 
Achstetter 1989. Eur. J. Biochem. 181:243; Hagenetal. 1991. Mol. Cell. Biol. 
5 1 1 :2952). In an exemplary embodiment, the yeast Bar J promoter can be engineered by 
mutagenesis to be more responsive, e.g., to more strongly promote gene transcription, 
upon stimulation of the yeast pheromone pathway. Standard techniques for 
mutagenizing the promoter can be used. In such embodiments, it is desirable that the 
conserved oligonucleotide motif described by Appeltaure et al. be conserved. 
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In yet other embodiments, rather than measuring second messenger production or 
alterations in transcription, the activity of endogenous yeast proteins can be assayed. For 
example, in one embodiment, the signal transduction pathway of the receptor 
upregulates expression or otherwise activates an enzyme which is capable of modifying 

5 a substrate which can be added to the cell. The signal can be detected by using a 
detectable substrate, in which case loss of the substrate signal is monitored, or 
alternatively, by using a substrate which produces a detectable product. In certain 
embodiments, the substrate is naturally occurring. Alternatively, the substrate can be 
non-naturally occurring. In preferred embodiments, BAR1 activity can be measured. 

10 In other embodiments, the modulation of a receptor by a test compound can 

result in a change in the transcription of a gene, which is not normally pheromone 
responsive. In preferred embodiments, the gene is easily detectable. For example, in a 
preferred embodiment, the subject assay can be used to measure Pho5, a secreted acid 
phosphatase. Acid phosphatase activity can be measured using standard techniques. 

15 In other embodiments, reporter gene constructs can be used. Reporter gene 

constructs are prepared by operatively linking a reporter gene with at least one 
transcriptional regulatory element. If only one transcriptional regulatory element is 
included it must be a regulatable promoter. At least one of the selected transcriptional 
regulatory elements must be indirectly or directly regulated by the activity of the 

20 selected cell-surface receptor whereby activity of the receptor can be monitored via 
transcription of the reporter genes. 

Many reporter genes and transcriptional regulatory elements are known to those 
of skill in the art and others may be identified or synthesized by methods known to those 
of skill in the art. Reporter genes include any gene that expresses a detectable gene 

25 product, which may be RNA or protein. Preferred reporter genes are those that are 
readily detectable. The reporter gene may also be included in the construct in the form 
of a fusion gene with a gene that includes desired transcriptional regulatory sequences or 
exhibits other desirable properties. 

Examples of reporter genes include, but are not limited to CAT (chloramphenicol 

30 acetyl transferase) (Alton and Vapnek (1 979), Nature 282: 864-869) luciferase, and other 
enzyme detection systems, such as beta-galactosidase; firefly luciferase (deWet et al. 
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(1987), Mol. Cell. Biol. 7:725-737); bacterial luciferase (Engebrecht and Silverman 
(1984), PNAS 1: 4154-4158; Baldwin et al. (1984), Biochemistry 23: 3663-3667); 
alkaline phosphatase (Toh et al. (1989) Eur. J. Biochem. 182: 231-238, Hall et al. (1983) 
J. Mol. Appl. Gen. 2: 101), human placental secreted alkaline phosphatase (Cullen and 
5 Malim (1 992) Methods in Enzymol. 216:362-368) and green fluorescent protein (U.S. 
patent 5,491,084; W096/23898). 

Transcriptional control elements include, but are not limited to, promoters, 
enhancers, and repressor and activator binding sites. Suitable transcriptional regulatory 
elements may be derived from the transcriptional regulatory regions of genes whose 
expression is rapidly induced, generally within minutes, of contact between the cell 
surface protein and the effector protein that modulates the activity of the cell surface 
protein. Examples of such genes include, but are not limited to, the immediate early 
genes (see, Sheng et al. (1990) Neuron 4: 477-485), such as c-fos. Immediate early 
genes are genes that are rapidly induced upon binding of a ligand to a cell surface 
protein. The transcriptional control elements that are preferred for use in the gene 
constructs include transcriptional control elements from immediate early genes, elements 
derived from other genes that exhibit some or all of the characteristics of the immediate 
early genes, or synthetic elements that are constructed such that genes in operative 
linkage therewith exhibit such characteristics. The characteristics of preferred genes 
from which the transcriptional control elements are derived include, but are not limited 
to, low or undetectable expression in quiescent cells, rapid induction at the 
transcriptional level within minutes of extracellular simulation, induction that is transient 
and independent of new protein synthesis, subsequent shut-off of transcription requires 
new protein synthesis, and mRNAs transcribed from these genes have a short half-life. 
It is not necessary for all of these properties to be present. 

Other promoters and transcriptional control elements, in addition to those 
described above, include the vasoactive intestinal peptide (VIP) gene promoter (cAMP 
responsive; Fink et al. (1988), Proc. Natl. Acad. Sci. 85:6662-6666); the somatostatin 
gene promoter (cAMP responsive; Montminy et al. (1986), Proc. Natl. Acad. Sci. 
8.3:6682-6686); the proenkephalin promoter (responsive to cAMP, nicotinic agonists, 
and phorbol esters; Comb et al. (1986), Nature 323:353-356); the phosphoenolpyruvate 
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carboxy-kinase gene promoter (cAMP responsive; Short et al. (1986), J. Biol. Chem. 
261:9721-9726); the NGFI-A gene promoter (responsive to NGF, cAMP, and serum; 
Changelian et al (1989). Proc. Natl. Acad. Sci. 86:377-381); and others that may be 
known to or prepared by those of skill in the art. 

5 In certain assays it may be desirable to use changes in growth in the screening 

procedure. For example, one of the consequences of activation of the pheromone signal 
pathway in wild-type yeast is growth arrest. If one is testing for an antagonist of a G 
protein-coupled receptor, this normal response of growth arrest can be used to select 
cells in which the pheromone response pathway is inhibited. That is, cells exposed to 

1 o both a known agonist and a peptide of unknown activity will be growth arrested if the 
peptide is neutral or an agonist, but will grow normally if the peptide is an antagonist. 
Thus, the growth arrest response can be used to advantage to discover peptides that 
function as antagonists. 

However, when searching for compounds which can function as modulators of 

1 5 pheromone response pathways, the growth arrest consequent to activation of the 

pheromone response pathway is an undesirable effect since cells that bind agonists stop 
growing while surrounding cells that fail to bind agonists will continue to grow. The 
cells of interest, then, will be overgrown or their detection obscured by the background 
cells, confounding identification of the cells of interest. To overcome this problem the 

20 present invention teaches engineering the cell such that: 1) growth arrest does not occur 
as a result of exogenous signal pathway activation (e.g., by inactivating the FAR1 gene); 
and/or 2) a selective growth advantage is conferred by activating the pathway (e.g., by 
transforming an auxotrophic mutant with a HIS3 gene under the control of a pheromone- 
responsive promoter, and applying selective conditions). 

25 Alternatively, the promoter may be one which is repressed by the receptor 

pathway, thereby preventing expression of a product which is deleterious to the cell. 
With a receptor repressed promoter, one screens for agonists by linking the promoter to a 
deleterious gene, and for antagonists, by linking it to a beneficial gene. Repression may 
be achieved by operably linking a receptor- induced promoter to a gene encoding mRNA 

30 which is antisense to at least a portion of the mRNA encoded by the marker gene 

(whether in the coding or flanking regions), so as to inhibit translation of that mRNA. 
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Repression may also be obtained by linking a receptor-induced promoter to a gene 
encoding a DNA binding repressor protein, and incorporating a suitable operator site 
into the promoter or other suitable region of the marker gene. 

In the case of yeast, exemplary positively selectable (beneficial) genes include 
S the following: URA3, LYS2, HISS, LEU2, TRP1; ADE1, 2,3,4,5, 7,8; ARGl, 3, 4, 5, 6, 8; 
HIS1, 4, 5; ILV1, 2, 5; THRJ, 4; TRP2, 3, 4, 5; LEW, 4; MET2,3,4,8,9,14,16 19 ' ' 
URA1,2,4,5,10; H0M3.6; ASP3; CHOI; ARO 2, 7; CYS3; OLE1; IN01,2,4; PR 0U . 
Countless other genes are potential selective markers. The above are involved in 
well-characterized biosynthetic pathways. The imidazoleglycerol phosphate dehydratase 
0 (IGP dehydratase) gene (HIS3) is preferred because it is both quite sensitive and can be 
selected over a broad range of expression levels. In the simplest case, the cell is 
auxotrophic for histidine (requires histidine for growth) in the absence of activation. 
Activation leads to synthesis of the enzyme and the cell becomes prototrophic for 
histidine (does not require histidine). Thus the selection is for growth in the absence of 
* hisndine. Since only a few molecules per cell of IGP dehydratase are required for 
histidine prototrophy, the assay is very sensitive. 

In another embodiment a reporter gene, e.g., the fusNacZ reporter plasmid can 
be introduced along with the AGS. The addition of an antagonist should result in a 
decrease in 0-galactosidase units over that observed in the absence of the antagonist 
demonstrating the ability of the antagonist to interact in a negative fashion with the ' 
AGS. 

In another version of the assay, cells can be selected for resistance to 
aminotriazole (AT), a drug that inhibits the activity of IGP dehydratase. Cells with low 
fixed level of expression of HIS3 are sensitive to the drug, while cells with higher levels 
are resistant. The amount of AT can be selected to inhibit cells with a basal level of 
HIS3 expression (whatever that level is) but allow growth of cells with an induced level 
of expression. In this case selection is for growth in the absence of histidine and in the 
presence of a suitable level of AT. 

In appropriate assays, so-called counterselectable or negatively selectable genes 
may be used. Suitable genes include: URA3 (orotidine-5'-phosphate decarboxylase; 
inhibits growth on 5-fluoroorotic acid), LYS2 (2-aminoadi P ate reductase; inhibits ' 
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growth on a-aminoadipate as sole nitrogen source), CYH2 (encodes ribosomal protein 
L29; cycloheximide-sensitive allele is dominant to resistant allele), CAN1 (encodes 
arginine permease; null allele confers resistance to the arginine analog canavanine), and 
other recessive drug-resistant markers. CAN1 is one example of a preferred reporter 
5 gene. 

In one example, the reporter gene affects yeast cell growth. The natural response 
to signal transduction via the yeast pheromone system response pathway is for cells to 
undergo growth arrest. This is a preferred way to select for antagonists of a 
ligand/receptor pair that stimulates a the pathway. An antagonist would inhibit the 

10 activation of the pathway; hence, the cell would be able to grow. Thus, the FAR1 gene 
may be considered an endogenous counterselectable marker. The FAR1 gene is 
preferably inactivated when screening for agonist activity. 

The reporter gene may also be a screenable gene. The screened characteristic 
may be a change in cell morphology, metabolism or other screenable features. Suitable 

15 markers include beta-galactosidase (Xgal, C12FDG, Salmon-gal, Magenta-Gal (latter 
two from Biosynth Ag)), alkaline phosphatase, horseradish peroxidase, exo-glucanase 
(product of yeast exbl gene; nonessential, secreted); luciferase; bacterial green 
fluorescent protein; (human placental) secreted alkaline phosphatase (SEAP); and 
chloramphenicol transferase (CAT). Some of the above can be engineered so that they 

20 are secreted (although not p-galactosidase). A preferred screenable marker gene is 
beta-galactosidase; yeast cells expressing the enzyme convert the colorless substrate 
Xgal into a blue pigment. Again, the promoter may be receptor-induced or 
receptor-inhibited. 

In another example using yeast cells, a screen can take advantage of the fact that 
25 a gpalfusl -HIS3 colony expressing wild type Gas can grow upon replica plating to 
media lacking histidine and containing ImM 3-aminotriazole (AT). The growth of this 
strain occurs due to the partially constitutive state of the pheromone pathway, which 
leads to partial derepression of thefusl-HlS3 reporter gene. AT inhibits the activity of 
IGP dehydratase. Cells with low, fixed level of expression of HIS3 are sensitive to the 
30 drug, while cells with higher levels are resistant. The amount of AT can be selected to 
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inhibit cells with a basal level of HIS3 expression (whatever that level is) but allow 
growth of cells with an induced level of expression. A colony to which an antagonist of 
an AGS has been added will presumably fail to grow on this media due to the reduced 
signaling via the pheromone pathway. 
5 In certain embodiments, a peptide library can be expressed to test for modulators 

of an AGS. In one embodiment, the peptide library is expressed by the cell that also 
expresses the AGS, thereby creating an "autocrine" system, wherein the peptide to be 
tested for AGS-modulating activity is made by the same cell expressing the AGS 
protein. For a general description of yeast cell autocrine systems for screening peptide, 
' see PCT Publication WO 94/23025 by Fowlkes et al., the contents of which are 

expressly incorporated herein by reference. In certain embodiments in which yeast cells 
are used, such a library can be expressed using a leader sequence for periplasmic 
expression, e.g., a yeast mating factor leader sequence. Yeast cells are bounded by a 
lipid bilayer called the plasma membrane. Between this plasma membrane and the cell 
wall is the periplasmic space. Peptides secreted by yeast cells cross the plasma 
membrane through a variety of mechanisms and thereby enter the periplasmic space. 
The secreted peptides are then free to interact with other molecules that are present in the 
periplasm or displayed on the outer surface of the plasma membrane. The peptides then 
either undergo re-uptake into the cell, diffuse through the cell wall into the medium, or 
become degraded within the periplasmic space. 

The test polypeptide library may be secreted into the periplasm by any of a 
number of exemplary mechanisms, depending on the nature of the expression system to 
which they are linked. In one embodiment, the peptide may be structurally linked to a 
yeast signal sequence, such as that present in the a-factor precursor, which directs 
secretion through the endoplasmic reticulum and Golgi apparatus. 

An alternative mechanism for delivering peptides to the periplasmic space is to 
use the ATP-dependent transporters of the STE6/MDR1 class. This transport pathway 
and the signals that direct a protein or peptide to this pathway are not as well 
characterized as is the endoplasmic reticulum-based secretory pathway. Nonetheless, 
these transporters apparently can efficiently export certain peptides directly across the 
plasma membrane, without the peptides having to transit the ER/Golgi pathway. It is 
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anticipated that at least a subset of peptides can be secreted through this pathway by 
expressing the library in context of the a-factor prosequence and terminal tetrapeptide. 
Use of either of the described pathways is within the scope of the invention. 

The present invention does not require periplasmic secretion of peptides, or, if 

5 such secretion is provided, any particular secretion signal or transport pathway. In 
certain embodiments, peptides expressed with a signal sequence may bind to and 
modulate AGS molecules prior to their transport to the cell surface. 

In other embodiments, a heterologous G-protein coupled receptor can be 
coexpressed with a heterologous AGS protein. In embodiments in which yeast cells are 

10 used, it may be desirable to use the leader sequence of a yeast secreted protein to direct 
transport of G-protein coupled receptors to the plasma membrane. 

In certain embodiments, the compounds to be tested in the subject assays can be 
derived from libraries. While the use of libraries of peptides is well established in the 
art, new techniques have been developed which have allowed the production of mixtures 

15 of other compounds, such as benzodiazepines (Bunin et al. 1992. J. Am. Chem. Soc. 
1 14:10987; DeWitt et al. 1993. Proc. Natl. Acad. Sci. USA 90:6909) peptoids 
(Zuckermann. 1994. J. Med. Chem. 37:2678) oligocarbamates (Cho et al. 1993. 
Science. 261 :1303), and hydantoins (DeWitt et al. supra). Rebek et al. have described an 
approach for the synthesis of molecular libraries of small organic molecules with a 

20 diversity of 104-105 (Carell et al. 1994. Angew. Chem. Int. Ed. Engl. 33:2059; Carell et 
al. Angew. Chem. Int. Ed. Engl. 1994. 33:2061). 

The compounds of the present invention can be obtained using any of the 
numerous approaches in combinatorial library methods known in the art, including: 
biological libraries; spatially addressable parallel solid phase or solution phase libraries, 

25 synthetic library methods requiring deconvolution, the 'one-bead one-compound' library 
method, and synthetic library methods using affinity chromatography selection. The 
biological library approach is limited to peptide libraries, while the other four 
approaches are applicable to peptide, non-peptide oligomer or small molecule libraries 
of compounds (Lam, K.S. Anticancer Drug Des. 1997. 12:145). 
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In one embodiment, the test compound is a peptide or peptidomimetic. In 
another, preferred embodiment, the compounds are small, organic non-peptidic 

compounds. 

Other exemplary methods for the synthesis of molecular libraries can be found in 
5 the art, for example in: Erbetal. 1994. Proc. Natl. Acad. Sci. USA 91:1 1422; Horwell 
etal. 1996 Immunopharmacology 33:68; and in Gallop etal. 1994. J. Med. Chem. 
37:1233. In addition, libraries such as those described in the commonly owned 
applications U.S.S.N. 08/864,241, U.S.S.N. 08/864,240 and U.S.S.N. 08/835,623 can 
be used to provide compounds for testing in the present invention. The contents of each 
0 of these applications is expressly incorporated herein by this reference. 

Libraries of compounds may be presented in solution (e.g., Houghten (1992) 
Biotechniaues 13:412-421), or on beads (Lam (1 991) Nature 354:82-84), chip. (Fodor 
(1993) Nature 364:555-556), bacteria (Ladner USP 5,223,409), spores (Ladner USP 
•409),plasmids (Cull etal. (1992) Proc Natl Acad Sci USA 89:1865-1869) or on phage 
< (Scott and Smith (1990) Science 249:386-390); (Devlin (X990)Science 249-404-406)- 
(Cwirlaetal. (1990) Proc. Natl. Acad. Sci. 87:6378-6382); (Felici (1991) J Mo, Biol 
222:301-310); (Ladner supra.). 

In certain embodiments, the test compounds are exogenously added to the yeast 
cells expressing a recombinant receptor and compounds that modulate signal 
transduction via the receptor are selected. In other embodiments, the yeast cells express 
the compounds to be tested. For example, a culture of the subject yeast cells can be 
further modified to collectively express a peptide library as described in more detail in 
PCT Publication WO 94/23025 the contents of which is expressly incorporated herein by 
this reference. 

Other types of peptide libraries may also be expressed, see, for example, U S 
Patents 5,270,181 and 5,292,646; and PCT publication W094/ 02502). In still another 
embodiment, the combinatorial polypeptides are produced from a cDNA library. 

Exemplary compounds which can be screened for activity include, but are not 
limited to, peptides, nucleic acids, carbohydrates, small organic molecules, and natural 
product extract libraries. In such embodiments, both compounds which agonize or 
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antagonize the receptor- or channel-mediated signaling function can be selected and 
identified. 

In certain other embodiments, cells can be engineered to produce the compounds 
to be tested. This assay system has the advantage of increasing the effective 

5 concentration of the compound to be tested. In one embodiment, a method such as that 
described in WO 94/23025 can be utilized. 

Other methods can also be used. For example, peptide libraries are systems 
which simultaneously display, in a form which permits interaction with a target, a highly 
diverse and numerous collection of peptides. These peptides may be presented in 

1 0 solution (Houghten ( 1 992) Biotechniques 13:41 2-42 1 ), or on beads (Lam (1991) Nature 
354:82-84), chips (Fodor (1993) Nature 364:555-556), bacteria (Ladner USP 5,223,409), 
spores (Ladner USP '409), plasmids (Cull et al. (1992) Proc Natl Acad Sci USA 
89:1865-1869) or on phage (Scott and Smith (1990) Science 249:386-390); (Devlin 
(\990)Science 249:404-406); (Cwirla et al. (1990) Proc. Natl. Acad Set 87:6378- 

15 6382); (Felici (1991) J. Mol. Biol 222:301-310); (Ladner supra.). Many of these 

systems are limited in terms of the maximum length of the peptide or the composition of 
the peptide (e.g., Cys excluded). Steric factors, such as the proximity of a support, may 
interfere with binding. Usually, the screening is for binding in vitro to an artificially 
presented target, not for activation or inhibition of a cellular signal transduction pathway 

20 in a living cell. While a cell surface receptor may be used as a target, the screening will 
not reveal whether the binding of the peptide caused an allosteric change in the 
conformation of the receptor. 

The Ladner et al. patent, USSN 5,096,81 5, describes a method of identifying 
novel proteins or polypeptides with a desired DNA binding activity. Semi-random 

25 ("variegated") DNA encoding a large number of different potential binding proteins is 
introduced, in expressible form, into suitable yeast cells. The target DNA sequence is 
incorporated into a genetically engineered operon such that the binding of the protein or 
polypeptide will prevent expression of a gene product that is deleterious to the gene 
under selective conditions. Cells which survive the selective conditions are thus cells 

30 which express a protein which binds the target DNA. While it is taught that yeast cells 
may be used for testing, bacterial cells are preferred. The interactions between the 
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protein and the target DNA occur only in the cell (and then only in the nucleus), not in 
the periplasm or cytoplasm, and the target is a nucleic acid, and not a receptor protein. 
Substitution of random peptide sequences for functional domains in cellular proteins 
permits some determination of the specific sequence requirements for the 
5 accomplishment of function. Though the details of the recognition phenomena which 
operate in the localization of proteins within cells remain largely unknown, the 
constraints on sequence variation of mitochondrial targeting sequences and protein 
secretion signal sequences have been elucidated using random peptides (Lemire et al J. 
Biol. CW(1 989) 264, 20206 and Kaiser et al. (1 987) Science 235:3 12, respectively). 
» In certain embodiments of the instant invention, the compounds tested are in the 

form of peptides from a peptide library. The peptide library of the present invention 
takes the form of a cell culture, in which essentially each cell expresses one, and usually 
only one, peptide of the library. While the diversity of the library is maximized if each 
cell produces a peptide of a different sequence, it is usually prudent to construct the 
library so there is some redundancy. Depending on size, the combinatorial peptides of 
the library can be expressed as is, or can be incorporated into larger fusion proteins. The 
fusion protein can provide, for example, stability against degradation or denaturation as 
well as a secretion signal if secreted. In an exemplary embodiment of a library for 
intracellular expression, e.g., for use in conjunction with intracellular target receptors, 
the polypeptide library is expressed as thioredoxin fusion proteins (see, for example, ' 
U.S. Patents 5,270,181 and 5,292,646; and PCT publication W094/ 02502). The 
combinatorial peptide can be attached one the terminus of the thioredoxin protein, or, for 
short peptide libraries, inserted into the so-called active loop. 

In one embodiment, the peptide library is derived to express a combinatorial 
library of polypeptides which are not based on any known sequence, nor derived from 
cDNA. That is, the sequences of the library are largely random. In preferred 
embodiments, the combinatorial polypeptides are in the range of 3-100 amino acids in 

length, more preferably at least 5-50, and even more preferably at least 10, 13, 15, 20 or 
25 amino acid residues in length. Preferably, the polypeptides of the library are of 
uniform length. It will be understood that the length of the combinatorial peptide does 
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not reflect any extraneous sequences which may be present in order to facilitate 
expression, e.g., such as signal sequences or invariant portions of a fusion protein. 

In another embodiment, the peptide library is a combinatorial library of 
polypeptides which are based at least in part on a known polypeptide sequence or a 
portion thereof (not a cDNA library). That is, the sequences of the library is semi- 
random, being derived by combinatorial mutagenesis of a known sequence. See, for 
example, Ladner et al. PCT publication WO 90/02909; Garrard et al., PCT publication 
WO 92/09690; Marks et al. (1992) J. Biol Chem. 267:16007-16010; Griffths et al. 
(1993) EMBO J 12:725-734; Clackson et al. (1991) Nature 352:624-628; and Barbas et 
al. (1992) PNAS 89:4457-4461 . Accordingly, polypeptide(s) which are known ligands 
for a target receptor can be mutagenized by standard techniques to derive a variegated 
library of polypeptide sequences which can further be screened for agonists and/or 
antagonists. 

In still another embodiment, the combinatorial polypeptides are produced from a 
cDNA library. 

In a preferred embodiment of the present invention, the cells collectively produce 
a "peptide library", preferably including at least 10 3 to 10 7 different peptides, so that 
diverse peptides may be simultaneously assayed for the ability to interact with the 
exogenous receptor. In an especially preferred embodiment, at least some peptides of 
the peptide library are secreted into the periplasm, where they may interact with the 
"extracellular" binding site(s) of an exogenous receptor. They thus mimic more closely 
the clinical interaction of drugs with cellular receptors. This embodiment optionally 
may be further improved (in assays not requiring pheromone secretion) by preventing 
pheromone secretion, and thereby avoiding competition between the peptide and the 
pheromone for signal peptidase and other components of the secretion system. 

In certain embodiments of the present invention, the peptides of the library are 
encoded by a mixture of DNA molecules of different sequence. Each peptide-encoding 
DNA molecule is ligated with a vector DNA molecule and the resulting recombinant 
DNA molecule is introduced into a yeast cell. Since it is a matter of chance which 
peptide encoding DNA molecule is introduced into a particular cell, it is not predictable 
which peptide that cell will produce. However, based on a knowledge of the manner in 
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peptide library for a specific peptide to interact with the receptor during transit through 
the secretory pathway. This has been postulated to occur in mammalian cells exhibiting 
autocrine activation. Such interaction could yield activation of the response pathway 
during transit, which would still allow identification of those cells expressing a peptide 

5 agonist. For situations in which peptide antagonists to externally applied receptor 

agonist are sought, this system would still be effective, since both the peptide antagonist 
and receptor would be delivered to the outside of the cell in concert. Thus, those cells 
producing an antagonist would be selectable, since the peptide antagonist would be 
properly and timely situated to prevent the receptor from being stimulated by the 

10 externally applied agonist. 

An alternative mechanism for delivering peptides to the periplasmic space is to 
use the ATP-dependent transporters of the STE6/MDR1 class. This transport pathway 
and the signals that direct a protein or peptide to this pathway are not as well 
characterized as is the endoplasmic reticulum-based secretory pathway. Nonetheless, 

15 these transporters apparently can efficiently export certain peptides directly across the 
plasma membrane, without the peptides having to transit the ER/Golgi pathway. It is 
anticipated that at least a subset of peptides can be secreted through this pathway by 
expressing the library in context of the a-factor prosequence and terminal tetrapeptide. 
The possible advantage of this system is that the receptor and peptide do not come into 

20 contact until both are delivered to the external surface of the cell. Thus, this system 
strictly mimics the situation of an agonist or antagonist that is normally delivered from 
outside the cell. Use of either of the described pathways is within the scope of the 
invention. The present invention does not require periplasmic secretion, or, if such 
secretion is provided, any particular secretion signal or transport pathway. 

25 After identifying certain test compounds as potential modulators of AGS 

proteins, the practitioner of the subject assay will continue to test the efficacy and 
specificity of the selected compounds both in vitro and in vivo. Whether for subsequent 
in vivo testing, or for administration to an animal as an approved drug, agents identified 
in the subject assay can be formulated in pharmaceutical preparations for in vivo 

30 administration to an animal, preferably a human. 
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the "bait" and the "prey" proteins are able to interact, in vivo, forming an AGS- 
dependent complex, the DNA-binding and activation domains of the transcription factor 
are brought into close proximity. This proximity allows transcription of a reporter gene 
{e.g., LacZ) which is operably linked to a transcriptional regulatory site responsive to the 

5 transcription factor. Expression of the reporter gene can be detected and cell colonies 
containing the functional transcription factor can be isolated and used to obtain the 
cloned gene which encodes the protein which interacts with AGS. 

Another aspect of the invention is directed to a yeast-based counter-screen for 
pheromone pathway inhibitors. This screen is based on a yeast cell constitutively 

1 o expressing a receptor-independent activator (e.g. , AGS 1 ) of the yeast pheromone 
response pathway such that a lethal marker gene is expressed (e.g., Canl). Then, a 
cDNA library of interest, cloned into an appropriate vector (e.g., pYES2), is introduced 
into this strain to identify cDNAs that, when expressed, counteract the activators ability 
to induce the expression of the lethal marker gene. This screen, therefore, can rapidly 

15 identify proteins that both directly and indirectly regulate activators of the pheromone 
response pathway. 

Modulators of AGS protein activity and/or AGS nucleic acid expression 
identified according to these drug screening assays can be used to treat, for example, 
diseases or disorders characterized by excessive or insufficient G-protein mediated 

20 signal transduction. Examples of such diseases or disorders which can be treated using 
modulators of AGS protein activity and/or nucleic acid expression include proliferative 
disorders and/or diseases. Support for the use of AGS modulators in treating 
proliferative disorders can be found in the fact that, for example, oncogenic mutations in 
ras-like G proteins have been implicated in approximately 30% of human cancers, 

25 including 90% of pancreatic and 50% of colon cancers (Bos (1988) Mutat. Res. 195:255- 
271; Bos (1989) Cancer Res. 49:4682-4689). In addition, defects in GPCR-mediated 
signaling have been associated with various disease states (Landis et al. (1989) Nature 
340:692-696 (pituitary tumors); Lyons et al. (1990) Science 249:655-659 (human 
endocrine tumors); Allen et al. (1991) Proc. Natl. Acad ScL USA 88:1 1354-1 1358 

30 (neoplastic transformation and atherosclerosis); and Farfel et al. (1 996) J. Biol. Chem. 
271:19653-19655 (pseudohypoparathyroidism)). As AGS is both a member of the ras 
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ftmily and an activator of heterotrimeric G-protein mediated signal transduction 
—tors of AGS function are likely to have pharmaceutical applications. Methods of 
treatment include the steps of administer^ the AGS mo.ecules of the present invention 
or modulators of AGS protein activity and/or nucleic acid expression, , g. in a 
5 pharmaceutical composition as described above, to a subject in need of such treatment. 

b * Detection an d Diag nostic Assays- 

The invention further provides a method for detecting the presence of AGS in a 
b.ological sample. The method involves contacting the biological sample with a 
0 compound or an agent capable of detecting AGS protein or mRNA such that the 
presence of AGS is detected in the biological sample. A preferred agent for detecting 
AGS mRNA is a labeled or labelable nucleic acid probe capable of hybridizing to AGS 
mRNA. The nucleic acid probe can be, for example, the AGS cDNA of SEQ ID NO- 1 
or a portion thereof, such as an oligonucleotide of at least 15, 30, 50, 100, 250 or 500 ' 
> nucleotides in length and sufficient to specifically hybridize under stringent conditions 
to AGS mRNA. A preferred agent for detecting AGS protein is a labeled or labelable 
anttbody capable of binding to AGS protein. Antibodies can be po.yclonal, or more 
preferably, monoclonal. An intact antibody, or a fragment thereof (e.g., Fab or F(ab') 2 ) 
can be used. The term "labeled or labelable", with regard to the probe or antibody is 
intended to encompass direct labeling of the probe or antibody by coupling (e g. 
Physically linking) a detectable substance to the probe or ant.body, as well as indirect 
labehng of the probe or antibody by reactivity with another reagent that is dzrectly 
labeled. Examples of indirect labeling include detection of a primary antibody using a 
fluorescently labeled secondary antibody and end-labeling of a DNA probe with biotin 
such that tt can be detected with fluorescently labeled streptavidin. The term "biological 
sample" is intended to include tissues, cells and biological fluids isolated from a subject 
as well as tissues, cells and fluids present within a subject. That is, the detection method 
of the mvention can be used to detect AGS mRNA or protein in a biological sample ,„ 
vitro as well as in vivo . For example, in vitro techniques for detection of AGS mRNA 
■nclude Northern hybridizations and in situ hybridizations. /„ yitro techniques for 
detects of AGS protein include enzyme linked immunosorbent assays (ELISAs) 
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Western blots, immunoprecipitations and immunofluorescence. Alternatively, AGS 
protein can be detected in vivo in a subject by introducing into the subject a labeled anti- 
AGS antibody. For example, the antibody can be labeled with a radioactive marker 
whose presence and location in a subject can be detected by standard imaging 
5 techniques. 

In one preferred embodiment of the detection method, the biological sample is a 
cell sample, a tissue section, for example, a freeze-dried or fresh frozen section of tissue 
removed from a patient, or a biological fluid obtained from a subject. 

The invention also encompasses kits for detecting the presence of AGS in a 

10 biological sample. For example, the kit can comprise a labeled or labelable compound 
or agent capable of detecting AGS protein or mRNA in a biological sample; means for 
determining the amount of AGS in the sample; and means for comparing the amount of 
AGS in the sample with a standard. The compound or agent can be packaged in a 
suitable container. The kit can further comprise instructions for using the kit to detect 

15 AGS mRNA or protein. 

c. Methods of Treatment : 

Additional methods of the invention include methods for treating a subject 
having a disorder characterized by aberrant AGS activity or expression. These methods 

20 include administering to the subject an AGS modulator such that treatment of the subject 
occurs. The terms "treating" or "treatment" , as used herein, refer to reduction or 
alleviation of at least one adverse effect or symptom of a disease or disorder, e.g., a 
disease or disorder characterized by or associated with abnormal or aberrant AGS 
protein activity or AGS nucleic acid expression. 

25 As used herein, an AGS modulator is a molecule which can modulate AGS 

nucleic acid expression and/or AGS protein activity. For example, an AGS modulator 
can modulate, e.g., upregulate (activate) or downregulate (suppress), AGS nucleic acid 
expression. In another example, an AGS modulator can modulate (e.g., stimulate or 
inhibit) AGS protein activity. If it is desirable to treat a disease or disorder characterized 

30 by (or associated with) aberrant or abnormal (non-wild-type) AGS nucleic acid 

expression and/or AGS protein activity by inhibiting AGS nucleic acid expression, an 



SUBSTITUTE SHEET (RULE 26) 



WO 99/58670 

PCT/US99/1015I 

-68- 

AOS mediator can be an ^ mo|ecu|ei e g a tanm ^ teriw ^ 
Examples of antisense motales ^ cm be osed w jnhjb . ^ ^ 
express™ i„Cu de „, smse motales wh . ch ^ complemeniary M a port . on 
undated region of SEQ ,D NO : 3, which also includes the star. codo „ Md ,^ 

~ - COn,P ' eme "^ • 3 POTiM ° f 3 ' — of SEQ ,D 
NO: . AN AGS modulator which inhibits AGS nucleic acid expression can a ls „ be a 
small molecule or other drug, „. s , a small module or drug identified using me 
screemng assays described herein, which mhibtts AGS nucleic acid expression ,f i, is 

abnormal (non-wild-type, AGS nucleic acid express™ ^ AGS protem 
snmulatrng AGS nucleic acid expression, an AGS m„d»,a«o, can be. for example a 
nuc,e,c acid module encoding AGS («.» , a nucieic acid m „|ec u le comprising . ' 
nucleot.de sequence homologous to me nucieotide sequence of SEQ ,D NO:,) or a small 
molecule or other dtug, , s ., a small molectle (peptide, or thug identified us,„g the 
screening assays described herein, which s,imu,a,=s AGS nucleic acid expression 

Ahernatively, if i, is desirable ,o treat a disease or disorder character,^ by („, 
— ^^aberrantor^orm^nou-wild-typeJAGSnuc^ic acid expression 
and/or AGS protein activity by inhibiting AGS protein activity, an AGS modulator can 

drug tdenntled usmg the screening assays described herein, which inhibits AGS protein 
acvtty ,f it is desirable to trea, a disease or disorder characterized by ,„r associated 
wtth) aberrant „ r abnormal (non-wild-type, AGS nucleic acid expression and/or AGS 
^otein activity by s,imu,ah„g AGS protetn activity, an AGS modulator can he » active 
AGS protem or portion thereof (, s „ an AGS protem „ r portion thereof having an at™, 
acd sequence which ,s homologous to the amino acid sequence of SEQ ,D N0 2) or a 
small molecule or other drug, a small molecule or drug identified using the 
screemng assays described herein, which stimulates AGS protein activtty 

In another embodiment, an AGS modulator ,s an indirect modulator (e g . an 
.ndtreaactiva^rormhihtorofAGS protein expresston and/or acuvity. Forexample ., 
may be possible ,„ modulate the expression of an AGS protein by targeting a 
.™scripho„ factor which activates or represses AGS gene transcription. Accordingly 
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an AGS modulator is an indirect modulator which increases the expression and/or 
activity of an AGS-specific transcription factor. In another embodiment, an AGS 
modulator is an indirect modulator which decreases the expression and/or activity of an 
AGS-specific transcription factor. 
5 Other aspects of the invention pertain to methods for modulating a cell associated 

activity. These methods include contacting the cell with an agent (or a composition 
which includes an effective amount of an agent) which modulates AGS activity or AGS 
expression such that a cell associated activity is altered relative to a cell associated 
activity of the cell in the absence of the agent. As used herein, "a cell associated 
10 activity" refers to a normal or abnormal activity or function of a cell. Examples of cell 
associated activities include proliferation, migration, differentiation, production or 
secretion of molecules, such as proteins, and cell survival. In a preferred embodiment, 
the cell-associated activity is mediated by a G-protein signaling pathway. The term 
"altered" or "modulated" as used herein refers to a change, e.g., an increase or decrease, 
15 of a cell associated activity. In one embodiment, the agent stimulates AGS protein 
activity or AGS nucleic acid expression. Examples of such stimulatory agents include 
an active AGS protein, a nucleic acid molecule encoding AGS that has been introduced 
into the cell, and a modulatory agent which stimulates AGS protein activity or AGS 
nucleic acid expression and which is identified using the drug screening assays described 
20 herein. In another embodiment, the agent inhibits AGS protein activity or AGS nucleic 
acid expression. Examples of such inhibitory agents include an antisense AGS nucleic 
acid molecule, an anti- AGS antibody, and a modulatory agent which inhibits AGS 
protein activity or AGS nucleic acid expression and which is identified using the drug 
screening assays described herein. These modulatory methods can be performed in vitro 
25 (e.g. , by culturing the cell with the agent) or, alternatively, in vivo (e.g. , by administering 
the agent to a subject). In a preferred embodiment, the modulatory methods are 
performed in vivo, e.g., the cell is present within a subject, e.g., a mammal, e.g., a 
human, and the subject has a disorder or disease characterized by or associated with 
abnormal or aberrant AGS activity or expression. 
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A nucle.c acid molecule, a protein, an AGS modulator etc. used in the methods 
of treatment can be incorporated into an appropriate pharmaceutical composition 
descnbed herem and administered to the subject through a route which allows the 
-lecule, protein, modulator etc. to perform its intended function. Examples of routes 
ofadministrationarealsodescribedherei, Particular AGS inhibitory agents and AGS 
stimulatory agents are described further below. 
Inhibitory Agents 

According ,o a mediatory msth od of the inve „ tion , AGS activity is inhibited jn 
a ce by conning the ce „ wilh ^ mbiMy agM InUbkoiT agaB ^ 

can be, for example, intraceihfar binding moltcu|es , hM M „ inhibji fc 
.c,,v,„ of AGS. As used herein, te ttrm ^ ^ . 

mc ude mo.ecules *« « "—riy ,„ inhibit fc express or rf , 

byb.„d,„ gt „ lhepr o t ei„i Be ,f, toanudeic ^ (esmmraAmoiecuie)te ^ 

«hc protem or ,o a target with which «he protein indirecUy interacts. Examples of 
. «~U*r bindmg molecules, descnbed in fcnher detail b,,„w, include polypeptides 
that directly or indirectly bind an AGS moiecule or target molecule, antisense AGS 
nuc eic acid molecules ( ,,, ,„ inhibit ttamla , m of AGS mRNA), insula, anti- 
AG asdics (,,., „ innibit ^ ^ of AGS proie . n) ^ chemj£a| inhita 
the AGS protein. 

In one embodiment, an inhibitor of an AGS or AGS-related molecule is 
•dented by modifying one of the above-mentioned assays to activate a reporter gene 
thatpreventsgrowthC,,.,^). Thus, candidate inhibitors of an AGS or AGS-related 
activator that can bloc k theAGS induction ofthe growth inhibiting gene can be rapidly 

the screening of anucleic acid library oracompound library. Accordingly in one 

embodiment, an identified inhibitor of an AGS molecule is the rp^ , 

m Mn- ™ u- , ■ , molecule is the RGS5 polypeptide (SEQ 

ID NO. 25) which etther directly or indirectly down-regulates AGS activity 

In another embodiment, an inhibitory agent ofthe invention is an antisense 

sa.d gene, or a recombinant expression vector encoding said antisense nucleic acid 
molecule. The use of antisense nucleic acids to downregulate the expression of a 
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particular protein in a cell is well known in the art (see e.g., Weintraub, H. et al. , 
Antisense RNA as a molecular tool for genetic analysis, Reviews - Trends in Genetics, 
Vol 1(1) 1986; Askari, F.K. and McDonnell, W.M. (1996) N. Eng. J. Med. 334:316- 
318; Bennett, M.R. and Schwartz, S.M. (1995) Circulation 92:1981-1993; Mercola, D. 
5 and Cohen, J.S. (1995) Cancer Gene Ther. 2:47-59; Rossi, J.J. (1995) Br. Med Bull. 
51:217-225; Wagner, R.W. (1994) Nature 372:333-335).e.g.e.g. Preferably, an antisense 
nucleic acid is designed so as to be complementary to a region preceding or spanning the 
initiation codon on the coding strand or in the 3' untranslated region of an mRNA. An 
antisense nucleic acid for inhibiting the expression of AGS protein in a cell can be 
10 designed based upon the nucleotide sequence encoding the AGS protein (e.g., SEQ ID 
NO: 1), constructed according to the rules of Watson and Crick base pairing (e.g., as 
described above in subsection I). 

An antisense nucleic acid can exist in a variety of different forms. For example, 
the antisense nucleic acid can be an oligonucleotide that is complementary to only a 
15 portion of an AGS gene. An antisense oligonucleotides can be constructed using 

chemical synthesis procedures known in the art.e.g. To inhibit AGS expression in cells 
in culture, one or more antisense oligonucleotides can be added to cells in culture media, 
typically at about 200 fig oligonucleotide/ml. 

Alternatively, an antisense nucleic acid can be produced biologically using an 
20 expression vector into which a nucleic acid has been subcloned in an antisense 

orientation (e.g. , nucleic acid transcribed from the inserted nucleic acid will be of an 
antisense orientation to a target nucleic acid of interesting. 

In another embodiment, an antisense nucleic acid for use as an inhibitory agent is 
a ribozyme. Ribozymes are catalytic RNA molecules with ribonuclease activity which 
25 are capable of cleaving a single-stranded nucleic acid s such as an mRNA, to which they 
have a complementary region (for reviews on ribozymes see e.g., Ohkawa, J. et al. 
(1995) J. Biochem. H8:251-258; Sigurdsson, S.T. and Eckstein, F. (1995) Trends 
Biotechnol 13:286-289; Rossi, JJ. (1995) Trends Biotechnol. 13:301-306; Kiehntopf, 
M. et al (1995) J. Mot. Med. 73:65-71). A ribozyme having specificity for AGS mRNA 
30 can be designed based upon the nucleotide sequence of the AGS cDNA. For example, a 
derivative of a Tetrahymena L-19 IVS RNA can be constructed in which the base 
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hybridoma-derived monoclonal antibody or a recombinant antibody from a 
combinatorial library), DNAs encoding the light and heavy chains of the monoclonal 
antibody are isolated by standard molecular biology techniques. For hybridoma derived 
antibodies, light and heavy chain cDNAs can be obtained, for example, by PCR 

5 amplification or cDNA library screening. For recombinant antibodies, such as from a 
phage display library, cDNA encoding the light and heavy chains can be recovered from 
the display package (e.g., phage) isolated during the library screening process. 
Nucleotide sequences of antibody light and heavy chain genes from which PCR primers 
or cDNA library probes can be prepared are known in the art. For example, many such 

10 sequences are disclosed in Kabat, E.A., et al (1991) Sequences of Proteins of 

Immunological Interest, Fifth Edition, U.S. Department of Health and Human Services, 
NIH Publication No. 91-3242 and in the "Vbase" human germline sequence database. 

Once obtained, the antibody light and heavy chain sequences are cloned into a 
recombinant expression vector using standard methods. To allow for cytoplasmic 

15 expression of the light and heavy chains, the nucleotide sequences encoding the 
hydrophobic leaders of the light and heavy chains are removed. An intracellular 
antibody expression vector can encode an intracellular antibody in one of several 
different forms. For example, in one embodiment, the vector encodes full-length 
antibody light and heavy chains such that a full-length antibody is expressed 

20 intracellularly. In another embodiment, the vector encodes a full-length light chain but 
only the VH/CH1 region of the heavy chain such that a Fab fragment is expressed 
intracellularly. In the most preferred embodiment, the vector encodes a single chain 
antibody (scFv) wherein the variable regions of the light and heavy chains are linked by 
a flexible peptide linker (e.g., (Gly4Ser)3) and expressed as a single chain molecule. To 

25 inhibit AGS activity in a cell, the expression vector encoding the anti-AGS intracellular 
antibody is introduced into the cell by standard transfection methods, as discussed 
hereinbefore. 

Other inhibitory agents that can be used to inhibit the activity of an AGS protein 
are chemical compounds that directly inhibit AGS activity or inhibit the interaction 
30 between AGS and target molecules. Such compounds can be identified using screening 
assays that select for such compounds, as described in detail above. 
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Stimulatory Agents 
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For practicing the modulatory method in vivo in a subject, the modulatory agent 
can be administered to the subject such that AGS activity in cells of the subject is 
modulated. The term "subject" is intended to include living organisms in which an 
AGS-dependent cellular response can be elicited. Preferred subjects are mammals. 
5 Examples of subjects include humans, monkeys, dogs, cats, mice, rats, cows, horses, 
goats and sheep. 

For stimulatory or inhibitory agents that comprise nucleic acids (including 
recombinant expression vectors encoding AGS protein, antisense RNA and intracellular 
antibodies), the agents can be introduced into cells of the subject using methods known 

10 in the art for introducing nucleic acid (e.g. , DNA) into cells in vivo. Examples of such 
methods encompass both non- viral and viral methods, including: 

Direct Injection: Naked DNA can be introduced into cells in vivo by directly 
injecting the DNA into the cells (see e.g., Acsadi et al (1991) Nature 332:815-818; 
Wolff et al (1990) Science 247:1465-1468). For example, a delivery apparatus (e.g., a 

15 "gene gun") for injecting DNA into cells in vivo can be used. Such an apparatus is 
commercially available (e.g., from BioRad). 

Cationic Lipids: Naked DNA can be introduced into cells in vivo by complexing 
the DNA with cationic lipids or encapsulating the DNA in cationic liposomes. 
Examples of suitable cationic lipid formulations include N-[-l-(2,3- 

20 dioleoyloxy)propyl]N,N,N-triethylammonium chloride (DOTMA) and a 1 : 1 molar ratio 
of l,2-dimyristyloxy-propyl-3-dimethylhydroxyethylammonium bromide (DMRIE) and 
dioleoyl phosphatidylethanolamine (DOPE) (see e.g., Logan, J.J. et al. (1995) Gene 
Therapy 2:38-49; San, H. et al. (1993) Human Gene Therapy 4:781-788). 

Receptor-Mediated DNA Uptake: Naked DNA can also be introduced into cells 

25 in vivo by complexing the DNA to a cation, such as polylysine, which is coupled to a 
ligand for a cell-surface receptor (see for example Wu, G. and Wu, C.H. (1 988) J. Biol 
Chem. 263:14621; Wilson et al (1992) J. Biol Chem. 267:963-967; and U.S. Patent No. 
5,1 66,320). Binding of the DNA-ligand complex to the receptor facilitates uptake of the 
DNA by receptor-mediated endocytosis. A DNA-ligand complex linked to adenovirus 

30 capsids which naturally disrupt endosomes, thereby releasing material into the 

cytoplasm can be used to avoid degradation of the complex by intracellular lysosomes 
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Adenoviruses: The genome of an adenovirus can be manipulated such that it 
encodes and expresses a gene product of interest but is inactivated in terms of its ability 
to replicate in a normal lytic viral life cycle. See for example Berkner et al (1988) 
BioTechniques 6:616; Rosenfeld et al (1991) Science 252:43 1-434; and Rosenfeld et al 

5 (1992) Cell 68:143-155. Suitable adenoviral vectors derived from the adenovirus strain 
Ad type 5 dl324 or other strains of adenovirus (e.g., Ad2, Ad3, Ad7 etc.) are well known 
to those skilled in the art. Recombinant adenoviruses are advantageous in that they do 
not require dividing cells to be effective gene delivery vehicles and can be used to infect 
a wide variety of cell types, including airway epithelium (Rosenfeld et al (1992) cited 

10 supra), endothelial cells (Lemarchand et al (1992) Proc. Natl Acad. Sci. USA 89:6482- 
6486), hepatocytes (Herz and Gerard (1993) Proc. Natl Acad ScL USA 90:2812-2816) 
and muscle cells (Quantin et al (1992) Proc. Natl Acad ScL USA 89:2581-2584). 
Additionally, introduced adenoviral DNA (and foreign DNA contained therein) is not 
integrated into the genome of a host cell but remains episomal, thereby avoiding 

1 5 potential problems that can occur as a result of insertional mutagenesis in situations 
where introduced DNA becomes integrated into the host genome (e.g., retroviral DNA). 
Moreover, the carrying capacity of the adenoviral genome for foreign DNA is large (up 
to 8 kilobases) relative to other gene delivery vectors (Berkner et al cited supra; Haj- 
Ahmand and Graham (1986) J. Virol 57:267). Most replication-defective adenoviral 

20 vectors currently in use are deleted for all or parts of the viral El and E3 genes but retain 
as much as 80 % of the adenoviral genetic material. 

Adeno-Associated Viruses: Adeno-associated virus (AAV) is a naturally 
occurring defective virus that requires another virus, such as an adenovirus or a herpes 
virus, as a helper virus for efficient replication and a productive life cycle. (For a review 

25 see Muzyczka et al. Curr. Topics in Micro, and Immunol (1992) 158:97-129). It is also 
one of the few viruses that may integrate its DNA into non-dividing cells, and exhibits a 
high frequency of stable integration (see for example Flotte et al (1992) Am. J. Respir. 
Cell Mol Biol 7:349-356; Samulski et al (1989) J. Virol 63:3822-3828; and 
McLaughlin et al (1989) J. Virol 62:1963-1973). Vectors containing as little as 300 

30 base pairs of AAV can be packaged and can integrate. Space for exogenous DNA is 
limited to about 4.5 kb. An AAV vector such as that described in Tratschin et al (1985) 
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This invention is farther illustrated by the following examples which should not 
be construed as limiting. 



The following experimental procedures were used in the Examples: 

Plasmids and yeast strains - All DNA manipulations were done using standard 
recombinant DNA techniques (see e.g., Sambrook et al. (1989) Molecular Cloning: A 

10 Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, NY; 
and Ausubel et al. (1994) Current Protocols in Molecular Biology, John Wiley & Sons, 
NY); using commercially available enzymes (New England Biolabs, BRL, United States 
Biochemical Corporation). Transformations of bacterial strain DH10B were performed 
by electroporation at 1 .8 kV in a GenePulser II (BioRad). Transformations of yeast were 

15 performed as described in Ito et al. (1983) J. BacterioL 153:163-168; and Elble, R. 
(1992) Biotechniques 13:18-19. 

Yeast strains used in this study are listed below in Table 1 . 



EXAMPLES 



TABLE 1: 



20 



YEAST STRAIN 



GENOTYPE 



25 



CY1316 



CY4600 



CY 12444 



MATagpalA farlA tbtl-1 fuslp-HIS3 
can! stel4::trpl::LYS2 ste3A lys2 ura3 
leu2 trpl his3 

MATagpalA ste4A farlA tbtl-1 
fuslp-HIS3 canl ste!4::trpl ::LYS2 
ste3A lys2 ura3 leu2 trpl his3 
MATagpalA steSA farlA tbtl-1 
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CY12970 



CY1141 



CY7967 



CY8342 



CY9603 



CY9571 



PCT/US99/I0I51 

fuslp-HIS3 canl stel4::trpl ;:LYS2 
ste3A lys2 ura3 leu2 trpl his3 
MATagpalA ste20A farlAfuslp- 
H1S3 canl stel4::trpl;:LYS2 ste3A 
lys2 ura3 leu2 trpl his3 

MATctGPAl ( }_ 41) -Gcd2 far J A 
tbtl-1 fuslp-HIS3 canl 

stel4::trpl::LYS2 ste3A lys2 ura3 
leu2 trpl his3 

MATaGPAl (1 . 4]) .Gcd3 farlA 
tbtl-1 fuslp-HIS3 canl 

stel4::trpl::LYS2 ste3A fys2 ura3 
leu2 trpl his3 

MATafrat Gas] far I A tbtl-1 
fuslp-fflS3 canl stel4::trpl. ;LYS2 
ste3A lys2 ura3 leu2 trpl his3 
MATa GPAl(]_ 41) -Gal6(S270P) 
fori A tbtl-1 fuslp-HIS3 canl 

stel4::trpl::LYS2 ste3A lys2 ura3 
leu2 trpl his3 

MATa GPA1 farlA tbtl-1 fuslp- 
HIS3 canl stel4::trpl.:LYS2 ste3A 
lys2 ura3 leu2 trpl his3 
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CY13 16 (MATa gpalAfarA tbtl-1 fusl-HIS3 canl stel4::trpl::LYS2 ste3A lys2 
ura3 leu2 trpl his3\ the parent of all strains used in this study, was obtained by standard 
genetic techniques, with SY1390 (Stevenson et al. (1992) Genes Dev. 6:1293-1304) 
(provided by G. Sprague), and SMI 188 (Sapperstein et al. (1994) Mol Cell Biol 

5 14:1 438-1 449 ) (provided by S. Michaelis) serving as the original sources of tiiftfiisl- 
HIS3 and steN alleles, respectively. Unless otherwise indicated, all genomic disruptions 
were made with the URA3 gene, followed by selection on 5'-fluoroorotic acid (Boeke et 
al. (1 987) Methods. Em. 154: 164-195). Got genomic integrations were made at the 
GPA1 locus and verified by Southern, Got expression, and phenotypic analysis. Plasmid 

10 pR15 (Beals et al. (1987) Proc. Natl Acad. Sci. USA 84:7886-7890), carrying the coding 
region of human Gai2. Plasmid CP1 127, carrying the promoter sequences and first 41 
amino acid codons of GPA J, was prepared by ligation of a sequence encompassing 
nucleotides -200 to +100 ofGPAl (where translational start is +1) to pRS405 (Sikorski 
and Hieter (1989) Genetics 122:19). Plasmid CP1 183, carrying the GPA1 (]-4i)-Gai2 

1 5 chimera sequence, was made by PCR amplification of the Gcti2 coding region 

encompassing amino acids 36 to its stop codon at position 357 using the oligo pair SEQ 
ID NO: 4 and SEQ ID NO: 5 and using plasmid pRl 5 as template. The amplified 
product was digested with Sad and Sail, then ligated into SacUSaR digested CP1 1 27. A 
glycine to alanine alteration at codon 204 of Gcti2 in CP1 183 was introduced using 

20 Stratagene's QuickChange kit and mutagenic oligos SEQ ID NO: 6 and SEQ ID NO: 7 
creating plasmid CP5533. Sequences encoding (3-galactosidase (lacZ) were introduced 
downstream of the fusl promoter on plasmid pRS424 (Sikorski and Hieter (1989) 
Genetics 122:19) to create CP 1584. Plasmid pSMl 87, with a 4.3 kb DNA fragment 
carrying the STEN gene flanked by BamHl sites, was kindly provided by S. Michaelis. 

25 This flamHI fragment was inserted into BamHl digested and shrimp alkaline 

phosphatase treated pRS415 and pRS414 (Sikorski and Hieter (1989) Genetics 122:19) 
to create, respectively, plasmids CPS 108 and CP5336. 

The yeast phosphogly cerate kinase (PGK1) promoter sequence was amplified 
from yeast genomic DNA using the oligonucleotide pair SEQ ID NOs: 26 and 27, 

30 digested with BgRl and Ncol and ligated into Esp3VNcol digested Yep51Nco (Broach 
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et al. (1983) /„ Experimental Manipulation of Gene Expression (M. Inouye ed ) 
Academic Press, NY). A 0.95 Kb fragment encompassing this promoter was then 
excsed with B gl ll and BamHl for ligation into pRS415 and P RS424 (Sikorski and 
H.ter (1989) Genetics 122:19). The coding sequence for human RGS4 was amplified 
5 by reverse-transcriptase PGR from human brain poIy-A selected RNA and ligated 

downstream of the PGK1 promoter in pRS424. RGS4 sequences were then amplified by 
PCR usmg the oligonucleotide pair SEQ ID NOS:_28 and 29, digested with BamHl and 
Xhol, and ligated into BamHVXhol digested P YES2 (Invitrogen). FUS1 promoter 
sequences were amplified from yeast genomic DNA usmg the oligonucleotide pair 
° SE Q I DNOS:30and31,d 1 gestedwithA>«Iand&/I,and g elpunfied. The CAN1 
codmg reg ]0n was amplified from yeast genomic DNA using the oligonucleot.de pair 
SEQ ID NOS: 32 and 33, digested with Atol and Pstl, and gel purified. Both purified 
fragments were ligated to PstllKpnl digested P RS4 14 (Sikorski et al. (1989) Genetics 
122: 1 9-27) to create plasmid 2440. The coding sequence for human AGS1 (see below) 
was amphfied by PCR using the oligonucleotide pair SEQ ID NOS- 34 and 35 
digested with Bspm and Sail, and ligated into a NcoVSali digested derivative of P RS415 
carrymg the ^promoter, creating plasmid 1451-AGS1. The Stratagene 
QuikChange kit and protocol were used to introduce a glycine to valine alteration at 
codon 31 of AGS 1 in P YES2 using the mutagenic oligonucleotides of SEQ ID NOS- 36 
and 37, as well as a cysteine to serine alteration at codon 278 of AGS1 using the 
mutagenic oligonucleotides of SEQ ID NOS: 38 and 39. 

A customized adult human liver cDNA library, ligated into the yeast vector 
PYES2 at BstXl and EcoRl, was constructed by Invitrogen. This library was size 
selected at 1250 bp and contains approximately 1 .77 x 1 0 7 primary recombinants 

The vector pYEX4Ti, which allows for copper inducible expression in yeast of 
sequences fused at the N-terminus with GST, was purchased from ^ ^ 
Vector pGEX-KG, which allows for IPTG inducible expression in bacteria of sequences 
fused at the N-terminus with GST (Guan and Dixon (1991), Anal. Biochem 192 262- 
267). Sl milar vectors are also commercially available. The vectors P cDNA3. IHisC and 
pBlueBacHis2A, which allow for expression of N-terminal hexahistidine fusion 
protems, were purchased from Invitrogen. The vector pIND which allows for inducible 
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expression from mammalian cells, was purchased from Invitrogen. The cDNA for gene 
AGS was amplified from pYES2-AGS by PCR using the oligo pair provided in SEQ ID 
NOS: 8 and 9, cleaved with BamHl and EcoM , and ligated into BamHVEcoRl digested 
pYEX4Ti, pGEX-KG, pcDNA3.1HisC, pBlueBacHis2A and pIND. The coding 
5 sequence for Cdc42 was amplified by PCR from yeast genomic DNA using the oligo 
pair provided in SEQ ID NOS: 1 0 and 1 1, cleaved with BamHl and £coRI, and ligated 
into BamHUEcom digested pGEX-KG. Single glycine to valine alterations at codons 31 
and 36, and a glycine to alanine alteration at codon 81, were constructed using the 
Stratagene QuickChange kit and protocol and the respective mutagenic oligo pairs 
10 provided in SEQ ID NOS: 12 and 13, Hand 15, and 16 and 17. 

Additional amino acid substitutions, using appropriately designed oligonucleotides and 
standard techniques, were also introduced into AGS1 at the following residue positions 
(in brackets): serine to glycine (33); serine to alanine (80); asparagine to glycine (82); 
phenylalanine to asparagine (44); serine to glycine (33) and serine to alanine (80); serine 
15 to glycine (33) and asparagine to glycine (82); serine to alanine (80) and asparagine to 
glycine (82); serine to glycine (33), serine to alanine (80), and asparagine to glycine 
(82); serine to glycine (33), serine to alanine (80), asparagine to glycine (82), and 
glycine to alanine (81); lysine to glutamic acid (225) and lysine to glutamic acid (226); 
and, cysteine to serine (278). The plasmid pcDNA3.1HisC-AGS served as a template 
20 for mutagenesis. Following mutagenesis, AGS sequences were excised with Bam HI 
and Eco RI for subcloning into vectors described above. Bam HI and Eco RI sites were 
introduced via PCR, for example the Bam HI site was introduced using the primer set 
forth as SEQ ID NO:8 and the Eco RI site, introduced using the primer set forth as SEQ 
ID NO:9. Automated dideoxy sequencing was used to verify the correct construction of 
25 all plasmids used in the examples. 

Yeast screens - All yeast strains were grown in synthetic medium (0.67% yeast 
nitrogen base without amino acids supplemented with amino acids, adenine, uracil and 
2% carbon source as indicated). For the activator screens, cultures of CY1316/CP1 183 
were grown in glucose-supplemented medium lacking tryptophan to a cell density of 
30 approximately 2 x 10 7 /ml prior to transformation with lithium acetate. Approximately 
20 ug of an adult human liver cDNA library in vector pYES2 was used to transform 1 x 
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1 0 9 cells. Wormation mixw „ s were ^ ^ ^ ^ 

. W <opha„ and uracil and grown „ 30° for 20-22 hr. Replicas of 5ach pla , e „ m ^ 
made onto galactose medium lacking , OTtophan , uradI md md _ 

mM AT. Rephca plates were incubaKd a , 30 „ for ?2 ^ ^ cotata 
ontosnc^piateMaCdng^pnan^^..^^^ cd|s fcm ^ 
were .suspended i„ « water and eoua, nmite of ^ (apprMimaK|y 
spotted „»,o 3 p, a ,es: sucrose medium lacking trvpto^ ^ uraci|; ^ ^ 
acktng tryptophan, uracil Md hMdine 2jj d OTaidng , mM AT; ^ gaiMose 
l*kmg tryptophan, ^ md histidine ^ _ ^ at p|aKs ^ ^ 

» ; 0 -^ 8ta '^^a g a l ac 1 „se.dep e „de„ t g r owt h phe„o Iy pewe r ,c„ llur ed i „ 
hqu.d glucose m ed,„ m lacking ttyptopna, and uracil for plasmid recovery. 

For the inhibitor screen, initial transformations of CY1 141/2440/145I-AGS1 
wc rep e rf o m ,ed M d.scr ib edaboveford,eac,iva,o,sc reen . R e pl i casofeachpiattwere 
made onto selective galactose media* lacking umcil and arginine and containing 200 

Mg/mic^ne. Replica piates were incnba«a,30. for 72 hrattd growing coIonies 
patched onto selective sncros, plates lacking umci, for recovery. Cells from each patch 
were upended in sterile water and spotted as described above onto selective sucrose 
med,um lacking tnacil, omo ^ ^ „ ^ ^ ^ 

and argmme and containing 200 ug/ml canavartinc. Pla.es were grown a , 3 „. fo , 24 ta 
to replica-plated onto idennca, medium and allowed to grow for an additional 24 hr to 
d,.,nguis„ differentia, growth o„ ga,ac,„se. ^ wilh . 

phenotype were cultu^d in li q „id glucose medium lacking u™, for plasmid recovery 
*a,ns CV, Mn „ ^ CY , 14I/ , 45I . AGS1/2440 ^ ^ ^ a sM J' 

fashton on selective glucose medium lacking histidine and containing 2 mM AT or on 
selecve glucose medium Jacking argntine and containing 200 „g/ ml ca mi vanine 
Plasmtds were isolated from yeas, essentially as described in Strathem and 
Hrggins (1 991, Em . ^ , 9.329, exeept for Me omission rf fc 

punftctton s K p, attd transformed by electropotation into DHI0B for amp,if,caho„ and 
re- Wormation into yeas,. Isotaed plasmids were digest with Bs,Xl and EcRl and 
elecuophoresedon m agarose gels foranalysis of inserts. Fresh cultures of yeas, 
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strains CY1316/CP1 183 or CY1 141/2440/1451-AGS1 were transformed with each 
isolated plasmid, plated onto sucrose medium lacking tryptophan and uracil, and grown 
at 30° for 20-22 hr. Replica plating and spotting assays were then performed as 
described above to identify plasmids that conferred a galactose-dependent growth 
5 phenotype on all transformants. Sequence of inserts was determined by automated 
dideoxy sequencing using primers homologous to pYES2 vector sequences in the T7 
promoter region and the CYC1 terminator region, as well as primers internal to the insert 
sequence. 

Epistasis tests - Epistasis analysis on yeast strains was performed by introducing 

10 the pYES2 vector alone or AGS carried on the pYES2 vector into the indicated strains. 
Strains were grown to saturation in liquid sensitive sucrose medium. Approximately 
2000 cells from each transformant were spotted onto selective sucrose medium (to 
determine viability) and selective glucose and galactose medium lacking histidine and 
containing ImM AT. Plates were incubated at 30° for 2 days prior to photographing. 

15 B-galactosidase assays - Indicated yeast strains carrying a plasmid-bome FUS1- 

lacZ reporter construct were grown for 16-18 hours in selective glucose or galactose 
medium. Cells were harvested in log phase. Aliquots of cells (100 jil) were placed in 
96-well microtiter plates and 20^1 of 6X CPRG-Z buffer (360 mM Na 2 HP0 4 , 240 mM 
NaH 2 P0 4 , 60mM KC1, 6mM MgS0 4 , 2.5% Triton X-100, 16ul/ml p-mercaptoethanol, 

20 1 OmM (chlorophenol red-(J-D-galactopyranoside (CPRG) was added. Plates were 
incubated at 37° and reactions stopped with 60^1 1M sodium carbonate. Plates were 
read at 575 nm (using a Beckman Biomek® plate reader) and lacZ activity determined 
using the equation 1000 x A575/ (incubation time in min) (OD600 of original culture. 
All assays were done in triplicate on three independent isolates of each strain, and 

25 average activity +/- SEM was calculated. Data were analyzed using GraphPad Prism® 
software (Version 2.01). 

Sequence Alignments - Sequence alignments were performed using the 
CLUSTAL X multiple sequence alignment program (Version 1.0). CLUSTAL X is a 
windows-based version of CLUSTAL V (Higgins et al. (1991) CABIOS 8:189-191), 

30 modified as described in Thompson et al. (1994) Nuc. Acids Res. 22:4673-4680. 
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7 "-Panned usi„ 8 BLASTN , 

' * gap °P en > 5 > gap extension, 2: x dronoff <n- - 
wordsize, 11; and filter on ,• • X - dro P off ' 50, expect, 10.0; 

on. ' ' Xpect ' ,0 0 ' wor dsize, 3; and, filter, 

^^^^ The fin, , 

'0 flowed by two rounder pYEX4T -AGS wlt hBam HI and Eco RI, 

y two rounds of agarose gel purification (Qiagen) This fi*„ 
radiolabelled with a-f 32 P1 a gment Was 

» f -^^a n « unM r;i;r" , ' oM r pfcT,ssueNonheraBte 

h ^ f0rres,a »» '° *' andbtaic Neomycin (0 5 ^ Sd , 

POUNA3.1-HHC) was momtored in Neomycin 
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resistant clones by RNA blot analysis using the fall length cDNA coding sequence of 
AGS as probe (labeled with ct-[ 32 P]-deoxycytidine triphosphate as described above). 

Cultures of Spodoptera frugiperda (Sf9) were transfected with recombinant 
pBlueBacHisA-AGS. AGS-positive transfectants were screened for the production of p- 

5 galactosidase and AGS-positive phage were amplified according to the protocols 
provided by Invitrogen. 

In vitro transcription/translation assay - Protein expression from the original 
pYES2-AGS isolate and the pcDNA3. 1-HisC-AGS construct was determined by 
coupled in vitro transcription/translation reactions using the kit, protocol, and control 

10 plasmid from Clontech. Protein was labeled with S -methionine by standard 
techniques and final products electrophoresed on a 12% SDS-polyacrylamide gel. 
Following electrophoresis, protein was electroblotted onto a PVDF membrane; the blot 
was washed extensively with water, air dried, and exposed without screens to Fuji X-ray 
film for 24 hr at ambient temperature. 

1 5 Analysis of AGS protein expression levels - Expression levels of AGS tagged at 

the N-terminus with either GST or the His6 epitope were determined. For expression in 
Saccharomyces cerevisiae strain CY1 141 and E. coli strain BL21, AGS was introduced 
with an N-terminal GST fusion under control of the copper-inducible cupl promoter on 
plasmid pYEX4Ti-AGS (yeast) or an IPTG-inducible P tac promoter on plasmid pGEX- 

20 KG-AGS (bacteria). Plasmid CP5336, carrying an episomal copy of STE1 4, was also 
introduced into the yeast strains. For copper induction, cells were grown at 30° to a 
density of 5 x 10 6 cells/ml and treated with 0.1 mM CuS04 for 6 hr. Cells were 
harvested at a density of 1-2 x 10 7 /ml, and cell extracts were prepared by bead lysis in 
the presence of denaturation buffer (240 mM Tris-HCl, pH 6.8, 20% glycerol, 2% (w/v) 

25 SDS, 0.5M P-mercaptoethanol), followed by denaturation at 1 00° for 5 min. For IPTG 
induction, cells were grown at 30° in 2xYT supplemented with 100 ug/ml ampicillin to 
an initial optical density at 600 nm of 0.5, then treated with 0.5 mM IPTG for 3 hr. Cells 
were harvested by centrifugation and whole cell extracts prepared by denaturation at 100 
° for 5 min in denaturation buffer. 
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For transient expression in insect cells, recombinant phage carrying the 
pBlueBacHisA-AGS construct were used to infect cultures of Sf9 cells 2-3 d post 
mfect.on, cells were harvested by centrifugal and whole cell extracts prepared by 
denaturation at 100° for 5 min in denaturation buffer. Stable transfectants of HEK293 
5 cells carrying P cDNA3, -HisC-AGS were cultured to -80% confluency in DMEM a,d 
harvested by scraping in the presence of PBS. Cells were lysed at 4° m 5 mM Tris-HCl 
PH 7.4, 5 mM EDTA, 5 mM EGTA containing a protease inhibitor cocktail (Boehringer' 
Mannheim Complete™) by mechanical disn, P tion using a glass Dounce tissue grinder 
(Wheaton,Mil,vill e) NJ). Following a low-speed centniugation (3000 x g , 10min 4 o } 
10 to remove unlysed cells, supematants were centrifuged at ,00,000 x g , 30 min 4° to 
separate membrane and soluble fraction, Membrane pe.lets were resuspended by brief 
somcanon in lysis buffer and protein fractions denatured at 1 00° for 5 min in 
denaturation buffer. 

(.970,^327:6 8 0^ S 5 )a „ deWbloIK<iclIlt()PVDFniernl)raiies 
we re Hocked in 5% dry mi, k (w/v) „ ras . T md ^ ^ ^ ^ 

anttser™ raised agalnst 0ST ta Schlstos<lm , japonicum (signia)> a ] 2ooo 
monociona, a„ ti . RGS „ is6 ^ , ^ ^ ^ ^ 

4t * - an ' iSeram ( '" Vto ^>' » " ' *X» dilution of a„,i-Xprcss antiserum 
(Invnrogen). Blots were washed ex.er.ivdy with TBS-T and probed with a , ,0000 
ditaon of horsed peroxidase-conjugated donkey anti-rabb,. or anti-monse 
anuserum (Ametsham, f „, , „, ., jj, Afler washmg ^ ^ ^ ^ 
■rang the chemiluminescent substrate and protocol from Pierce. 

fracons from mammalian cells, (, g ., CHO-K, transiently transfected (72 hr post 
<mto»0 were scraped from , 50 mm dishes, washed twice i„ PBS, and cel. pellets 
lysed by 10-20 stmlces of a Dounce homogenize, in 5 mM Tris, pH 7.4, 5 mM EDTA 5 
-M EOTA, protease tnhibitor cocktail, a, 4- Lysates were centred a, ,00,000 x g for 
30 mtn a, 4- C, supernatant fractions removed, and etude membrane pellets resuspended 
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by Dounce homogenization in 50 mM Tris, pH 7.4, 0.6 mM EDTA, 5 mM MgCl 2 , and 
protease inhibitors were added prior to immunoblot analysis. 

Antisera Specific for AGS1 Polyclonal antisera specific for the AGS1 protein 
were raised in rabbit against the peptide sequence DTKSCLKNKTKENVD (SEQ. ID 
5 NO. 42) (amino acids 123 through 137 of AGS 1) through Cocalico Biologicals®. These 
antisera were determined to recognize AGS1 protein expressed in bacteria and 
mammalian cells by standard immunoblot analysis. 

Immunoblot Analysis - Yeast cell extracts, other cell extracts, and membrane 
fractions were immunoblotted using standard techniques and (the chemiluminescent 
10 substrate and protocol from Pierce. 

Analysis of Gpal Q-41] -Gai2 expression levels - Yeast strains CY1 3 1 6/CP1 1 83 
and CY1 3 1 6/CP5533 containing either pYES2 or pYES2-AGS were grown for 20 hr at 
30° in medium containing 2% galactose and lacking tryptophan and uracil to a final 
OD600 of approximately 1 . Cells were harvested by centrifugation, washed once with 
15 H2O, and lysed at 4° by mechanical disruption using 0.5 mm glass beads in 5 mM Tris- 
HC1, pH 7.4, 5 mM EDTA, 5 mM EGTA containing a protease inhibitor cocktail 
(Boehringer Mannheim Complete™). Following a low-speed centrifugation (3000 x g, 4 
0 C, 10 min) to remove cell debris, samples were centrifuged at 100,000 x g, 4° C, 30 
min to pellet crude membrane. Membrane pellets were resuspended in lysis buffer by 
20 brief sonication and samples were denatured, electrophoresed on 1 1 % SDS- 

polyacrylamide gels and analyzed by immunoblotting as described above. Antiserum 
specific for the C-terminal region of Gai2 (DuPont NEN) was used at a dilution of 
1:1000. 

Purification of tagged AGS for biochemical analysis - GST-AGS and GST- 
25 Cdc42 were purified from IPTG-induced E. coli cells carrying pGEX-KG-AGS or 

pGEX-KG-CDC42 using standard techniques. GST, GST-AGS, and GST-Cdc42 were 
purified from copper-induced yeast cells carrying pYEX-4Tl, pYEX-4Tl-AGS s or 
P YEX-4T1-Cdc42. 
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interactio^ - GST and an N-terminal GST fusion of AGS1 were expressed 
in yeast from the CUP1 promoter on p, asmid pYEX . 4T1 (Annad) ^ bound ^ 
glutath 10 ne sepharose matrix (Pharmada). After extensive washing, aliquots of matrix 
wuh bound GST fusion proteins were mixed with 100 ul extracts from Sf9 cells 
5 expressing either His 6 -Gai2 or the Ste4/His 6 -Stel8 dimer for 1 hour at 4° Extracts 
from His 6 -Goci2 expressing cells were preincubated with either ImM GDP or l mM OTP 
TS for 45 minutes at 25° C prior to incubation with GST affinity matrices. Samples were 
washed extensively and bound protein eluted at 1 00° C for 5 minutes in denaturation 
buffer. Equivalent amounts of eluted protein were analyzed by immunoblotting using a 
-0 : 00 dilution of anti-Xpress antisera (Invitrogen; recognized His 6 fus.n ep.tope) or a 
1:1000 dilution of anti-Ste4 antisera. Bound antibody was detected by 
chemiluminescence (Pierce). 

Ki-seAs^avs - The P 42/p44 MAP kinase assays were performed using the 
enzyme assay system (Amersham Life Science BIOTRAK® code RPN 84) essential.y as 
.5 according to the manufacture, Briefly, cells (,,., EcR-CHO-Kl, obtamed from 
Invttrogen) were transiently transfected with pIND/LacZ or pIND/AGS 1 DNA 
constats, EcR-3T3 (obtained from Invitrogen) cells transiently transfected with 
PIND/LacZ or pIND/AGSl DNA constructs, or CHO-K, cells stab.y expressing the 
human Nociceptin receptor (CADUS clone #14), were induced to produce LacZ or 
20 AGSlexpression using with 5 uM Ponasterone A for 5 and ,6hours. Cells were either 
pretreated with Pertusis toxin (50 ng/ml for 1 8 hours.) and/or stimulated w lt h serum 
(3%) or nociceptin (1 00 nM). 

After stimulation cells were washed, lysed, exposed to Magne SI um f?] ATP ( in 
appropriate buffers), and incubated for 30 min at 37° C. Reactions were then stopped 
with a stop reagent (10 ul), centrifuge for 15 sec at 14,000 x g, aliquoted (30,1) on to 
bmdmg paper, washed, and counts were recorded as a measure of kinase activity 

^H^^ - AGS1 were induced from cells 

carrymg, respectively, plasmid pYEX4Tl or P YEX4T1-AGS1 with 0.2 mM CuSO at 
30° for 6 hr. Induced cells were harvested by centrifugation, washed in PBS, and lysed 
by vortexing with glass beads (0.5 mm; Biospec) in lysis buffer (50 mM Tris-HC. P H 
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7.6, 50 mM NaCl, 0.5% Triton X-100, 1 mM DTT, 5 mM MgS0 4 , 1 \xl/ml Antifoam A 
(Sigma®)). All buffers used were at 4° and contained protease inhibitors (Boehringer 
Mannheim Complete™, EDTA-free). GST and GST lysates were centrifiiged at 10,000 
x g at 4°C for 1 5 min and supernatants mixed batch-wise on a rotating platform with 

5 glutathione sepharose (Pharmacia) for 2 hr at 4° C. Samples were washed with 20 
column volumes 50 mM Tris-HCl, pH 7.6, 50 mM NaCl, 0.01% Thesit, 5 mM MgS0 4J 
and eluted with 3 column volumes 50 mM Tris-HCl, pH 8.8, 150 mM NaCl, 5 mM 
MgS0 4s 0.01% Thesit, 20 mM reduced glutathione. The His 6 -Gai2 and Ste4/His 6 -Stel8 
dimer were purified from infected Sf9 cells after lysis in 1 10 mM Tris, pH 8.0, 100 mM 

10 NaCl, 5 mM MgCl 2 , 1 mM ethylene glycol-bis(p-aminoethyl ether) N,N,N\N'- 

tetraacetic acid (EGTA), 0.5% Triton-XlOO, 10 ^iM GDP by gentle rocking at 4° for 30 
min, followed by centrifiigation at 10,000 rpm for 10 min. Lysates were bound batch- 
wise to a nickel-agarose matrix (ProBond; Invitrogen®) for 1 hr at 4°. Samples were 
washed sequentially with 10 column volumes lysis buffer, 10 column volumes low pH 

15 buffer (20 mM NaP0 4 , pH 6.0, 500 mM NaCl, 5 mM MgCl 2 , 10 jiM GDP, 0.01% 

Thesit, 50 mM imidazole), 10 column volumes neutralization buffer (50 mM Tris-HCl, 
pH 7.0, 250 mM NaCl, 5 mM MgCl 2 , 0.01% Thesit), and eluted with 3 column volumes 
neutralization buffer containing 300 mM imidazole. 

All purified proteins were dialyzed against 50 mM Tris-HCl, pH 7.6, 150 mM 

20 NaCl, 5 mM MgS0 4 , 0.01% Thesit, 1 mM DTT and quick-frozen in dry ice for storage. 
Protein purity was monitored by gel electrophoresis and immunoblotting as described 
herein. Ga protein concentration was determined by saturation binding with excess 
radiolabeled GTPaS; GST fusion protein concentrations were determined by a dye- 
binding assay (BioRad®). 

25 Protein Association Assays - Purified Gai2 and Gpy (0.3 yM) were mixed with 

either purified GST or GST- AGS 1 (1.5 \M) in 100 |il 50 mM Tris, pH 7.6, 150 mM 
NaCl, 5 mM MgCl 2 , 0.6 mM EGTA, 0.01% Thesit for 45 min at 25°. GDi2 (at 7 ^iM) 
was allowed to equilibrate with 1 mM GDP or 1 mM guanosine 5'-0-(3- 
thiotriphosphate) (GTPaS) for 45 min at 25° prior to dilution into assay mixtures. 

30 Following association, 50 [il of a 1 :3 slurry of glutathione sepharose beads in association 
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buffer was added to each sample, and samples were mixed gently at 4° for 60 min. 
Beads were washed batch-wise with 5 x 500 ul association buffer and bound protein 
eluted at 100- for 5 min in 125 ul denaturation buffer (240 mM Tris-HCl, P H 6 8 20% 
glycerol, 2% ( w / v ) SDS, 0.5 M P -mercaptoethanol). Proteins were electrophoresed on 
5 1 2% sodium dodecyl sulfate-polyacrylamide gels and electroblotted onto polyvinylidene 
difluonde membranes. GST proteins were measured by staining membranes with 0 2% 
amido black. Membranes were blocked in 5% dry milk (w/v) in Tris-buffered saline 
0. 1% Tween-20 (TBS-T) and probed with either a 1:5,000 dilution of anti-Xpress 
antisera (Invitrogen®) or a 1 :2,000 dilution of anti-Ste4 antisera for 2 hr at 25° Blots 
10 were washed extensively with TBS-T and probed with a 1:10,000 dilution of horseradish 
peroxidase-conjugated donkey anti-mouse or anti-rabbit antiserum (Amersham®) for 1 
hr at 25°. After washing with TBS-T, blots were developed using the chemiluminescent 
substrate from Pierce* according to the manufacturers protocol. 

GTPaS Bindinp Ass ^ - A 6 pmol amount of purified His 6 -Gai2 or 
5 myristoylated Gail was incubated for 30 min at 25° either a,one or with 170 pmol GST 
or GST-AGS1 in the presence of 1.4 nmol GDP in a total volume of 280 pi assay buffer 
(50 mM Tris-HCl, pH 7.5, 150 mM NaCI, 0.6 mM ethylenediaminetetraacet. acid 5 
mM MgCl 2 , 0.01% Thesit). A 28 pmol amount of GTP^S (1.3 x 10< cpm/pmol) in 70 
Ml assay buffer was then added and 50 ul aliquots removed at the indicated times for 
filtrate onto nitrocellulose membranes. Filters were washed with 2 x 2 ml ice-cold 20 
mM Tris-HCl, P H 8.0, 100 mM NaCI, 25 mM MgCl 2 , air-dried, and bound counts 
determined in the presence of scintillation fluid. 

GTPase Activity Assays - Purified GST and GST-AGS1 or GST-Cdc42 (yeast) 
at 500 nM were incubated in 50 mM Tris, P H 7.4, 150 mM NaCI, 5 mM M g Cl 2 1 mM 
DTT, 0.01% Thesit with 5 uM y^-GTP at 25°. At the indicated times, 50 ul aliquots 
were removed into 750 ul of an ice-cold suspension of 5% charcoal (Norit®) in 50 mM 
NaH 2 P0 4 . Samples were vortexed, centrifuged, and 400 ul aliquots of supernatants 
removed for scintillation counting. 
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Guanine Nucleotide Binding Assays - Cultures of CY1316 (Agpal) and CY4600 
(AgpalAste4) carrying pYEX4Tl, P YEX4T1-AGS1 or pYEX4T 1 -CDC42 were grown 
in low phosphate medium (10 uM), and GST fusion proteins induced by the addition of 
200 uM CuSCy 30 min after copper addition, acid-free [ 32 P]-P0 4 was also added (50 

5 uCi/ml). Cells were grown an additional 5 hr, harvested, washed with cold water, and 
pellets quick frozen on dry ice. Thawed pellets were lysed by vortexing with beads in 50 
mM Tris, pH 7.4, 20 mM MgCl 2 , 50 mM NaCl, 0.5% Triton X-100, 1 mM DTT, 
protease and phosphatase inhibitors. Samples were centrifuged 10 min at 10000 rpm 
and supernatants bound to 150 ul glutathione sepharose for 2 hr at 4°. Columns were 

l o washed extensively in 50 mM Tris, pH 7.4, 20 mM MgCl 2 , 50 mM NaCl, 0. 1 % Triton 
X-100 and bound material eluted by heating to 70° in 50 mM Tris, pH 8.8, 20 mM 
reduced glutathione, 150 mM NaCl, 2% SDS, 20 mM EDTA, 2 mM GDP, 2 mM GTP. 
Eluted GST fusion proteins were quantitated by SDS-PAGE followed by staining gels in 
Coomassie blue, and equivalent amounts of GST fusion proteins were spotted onto PEI- 

15 cellulose plates. After allowing samples to dry, plates were washed extensively with 
ddH 2 0 followed by methanol, dried and developed in 1 M KP0 4 , pH 3.4. Dried plates 
were exposed to X-ray film for analysis. 

Abbreviations used herein are as follows: GPCR, G-protein coupled receptor; 
PAK, p-21 activated protein kinase; MAP kinase, mitogen activated protein kinase; 

20 RGS, regulator of G protein signaling; PCR, polymerase chain reaction; IGP, 

imidazoleglycerolphosphate; AT, 2-aminotriazole; SDS, sodium dodecyl sulfate; PVDF, 
polyvinylidene difluoride; GST, glutathione-S-transferase; TBS-T, Tris-buffered saline 
containing 0.1% Tween-20; DTT, dithiothreitol; GTPaS, guanosine-5'-o-(3- 
thiotriphosphate); CPRG, chlorophenolred-[J-D-galactopyranoside; and IPTG, isopropyl 

25 p-D-thiogalactopyranoside. 

EXAMPLE 1 : Yeast Host Strain for cDNA Library Screening 

Upon activation of the pheromone response pathway, Saccharomyces cerevisiae 
normally undergoes Farl -mediated growth arrest (Chang and Herskowitz (1990) Cell 
30 63:999-101 1; Peter and Herskowitz (1994) Science 265:1228-1231). Deletion of the 
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FA*, gene unco„p,es growm mesi ftom ptaomone ,„ duced 
Chang and Hersfcowtz (im CW,63:99,.,0, By fai,,^ 
he pheromone-inducible gene , 0 th£ ^ sequemes ^ ^ o 
dehydmase (encoded by „ e «, gene) can be regular by ac.iva.ion of 
5 pheromone response pamway (Price e, al. (1995) MoL Celt Biol 15:6188-6,95) 
indeed, ,his reg„,a,ed eons™, has proven nseft, m fc identWcatio „ ^ 
w.0, consttonve pheromone pamway aCvauon in the absence of a facional 
beta*** (Stevenson e, a!. (,992, Ge „„ A,v. 6:,293-,304; Stevenson e, a, (,995) 
^,9:2949-2963). Wo re , a « s M i„ casing ..e^.^ consIruct ^ 
be madeco„d.,io„a, for grow* upon pherotnone pathway activation by culm™ ,„is 
«- ^ medium .acting his.id.ne. In . sMar ^ ^, ^ ^ ^ ^ 

fused to P-gaiactosidase gene ( faZ) sequences, „d pheromone pathway activauon 
quanmated by coionmetric assays. ,„ practice, ^ fusl pron]oter ^ rf ^ 
consm.cs is often „„, enUreiy repressed in the absence of pheromone pathway 
■5 activationtKingeta, . (,990,5^ 250:,2 M 23, To c™,emc, mis in s^,, 

canying ,he«, Kpma , iow M 0 f AT can be added ,„ .be outure medium to 
Mly repress mS3 ac.ivi«y (Manfred, e. al. (,996) MoL Cm. Biol 16:4700-4709) 

In search for non-recep,„r modulators of me phenomena response pathway 
■ha. ac. ,„depe„dem,y of GPCRs, a yeas, slain ,acld„g ils ^ GpcR _ ^ ^ jK 

» ™ehe.ero n i m encGasubuni,,Gpa, ( CY,3,6;seeTab,e,)™su, 1 ,i 2 ed We 
mtroduced i„.o ,hi s s<rain ptemid CP,, 83, encoding a chimeric Ga cons,™, 
-mining me fa 4, amino acids of Gpa, msed „ ami„ oacids 36 . 356of humM Goi2 
Th,s emmeric cons.™ is capable of functionally coupling to me yeas, G Pr dimer 
■"lowing for full growth repression in me presence of 1 mM AT. 

mmSl-- Screening of. Human Liver cDNA Library for Pheromone 
Response Pathway Activators 

In vauda,ing the pheromore pamway activator screen a cDNA library derived 

flomadulthu^anliver^chos.n.rhehumanliv^libr.rywaschosenas.porenHa, 
genera, source of regulator, This cDNA libra^ was ligated into the higb-copy ^ 
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yeast vector pYES2. Sequences were ligated downstream of the yeast GAL1 promoter, 
making expression of cDNAs conditional upon the addition of galactose to the growth 
medium. The combination of a high-copy number vector and a strong galactose- 
inducible promoter increases the ability to detect even weak activators of the pheromone 
5 response pathway. A schematic of the pathway and yeast screen is shown respectively 

in Figs. 1A, IB, and 2. 

Yeast strain CY1316/1 183 was transformed with each of the libraries and 
transformation mixtures plated on a series of agar plates containing sucrose to select for 
primary transformants. Sucrose was used as a carbon source rather than glucose, as 

10 glucose actively represses the GAL1 promoter (Schneider et al. (1991) Meth. Enz. 

194:373-388). After allowing growth of primary transformants at 30° for 22 hr, replica 
plates were made onto galactose medium lacking histidine and containing 1 mM AT. 
Colonies appeared after approximately 2-3 d when grown at 30°. Those transformants 
unable to produce HIS3 still must deplete their intracellular stores of histidine prior to 

1 5 the onset of cell stasis, leading to background growth on the replica plates. By replica 
plating transformants before visible colonies have formed background growth is kept 
minimal, allowing for clearer selection of colonies that are true histidine prototrophs. If 
necessary, a second replica plating to identical selective medium can be performed to 
distinguish true histidine prototrophs from those colonies exhibiting delayed cell 

20 inviability. 

Growing colonies were recovered onto fresh sucrose plates containing histidine, 
and tested in a secondary screen on both glucose and galactose plates lacking histidine 
and containing 1 mM AT to identify galactose-dependent histidine prototrophs. Many 
of the colonies growing on the initial selection plates did not show galactose-dependent 

25 growth in this subsequent screen. These colonies may arise from several sources, 
including spontaneous AT resistance, genomic alterations to activate the pheromone 
response pathway, and spontaneous mutations leading to reversion of the genomic his3 
allele or constitutive activation of the FUS1 promoter. By expressing cDNAs under 
control of an inducible promoter, most colonies with plasmid-independent growth were 

30 eliminated by this rapid and simple secondary screen (see Table 2). 
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Plaids were iso, aW f rom SIrains galactose . depen(fcm ^ ^ 

«ed ,„ re-ran*™ „ aIve CYI3 16/] , 83 sM . ns TrmsfonMiiK w ^ 

sucrose medium, .hen ,epl,ca.p,a t ed _ g|ucoK ^ 

h,s»d,„e and containing , mM AT ^ ^ ta conferred 

fcpenden, grow* „„ , hls medium were ^ ^ 

pamway activator, Eigh, such ptanids were identified from these screens each 

coning te same ope. reading frame do„»tream of ,he GAL, promoter (designated 

LI, see Table 2). 
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All were found to contain the same open reading frame of 846 bases encoding a 
protein of 281 am.no acids, shown in Figures 3A and 3B (the nucleotide sequence of 
which is also shown in SEQ ID NO: 1 and the encoded amino acid sequence of which ,s 
also shown in SEQ ,D NO: 2). The open reading frame of these isolates was designated 
5 AGS, for Activator of G-protein Signaling. Each isolate contained varying amounts of 
5* and 3' untranslated sequence, and therefore appears to have been selected 
independently. A composite AGS nucleotide sequence containing the longest identified 
5' and 3' untranslated regions of AGS is shown in SEQ ID NO: 3. 

0 EXAMPLE3 : AGS-Mediated Activation of the Phenomena Pathway Occurs 
at the Level of the Heterotrimeric G-Protein 

Strains carrying P YES2-AGS1 exhibit galactose-dependent growth as well as 
galactose-dependent reporter gene activity (in strains also carrying the FUSlp-lacZ 
construct). The galactose-dependent growth phenotype conferred upon strain 
= CY1316/CP1 183 by P YES2-AGS was further characterized by genetic analysis. 
PYES2-AGS fails to confer galactose-dependent growth on medium lacking histidine 
and containing 1 mM AT to strains CY4600, CY12444 and CY12970, containing, 
respectively genomic disruptions of STE4, STE5, and STE20 (see Table 1). This lack of 
growth is consistent with AGS activation of signaling occurring through the pheromone 
response pathway (G P? - PAK -> MAPK cascade). Neither were the isolates of strains 
carrymg an episomal copy of human RGS4 under control of the constitutive PGK1 
promoter able to confer galactose-dependent growth. Furthermore, the inability of AGS 
to confer growth on a strain deleted for G P is consistent with AGS-mediated activation 
occurring at or above the level of the heterotrimeric G-protein. 

Plasmid pYES2-AGS was also unable to confer galactose-dependent growth on 
medium lacking histidine and containing 1 mM AT to strain CY1316/CP5533, carrying 
the G204A allele of GPAJ ( j. 4}J . Gai2 . Three independent isolates each of strains 
CY1316/ P YES2 and CY1316/pYES2-AGS carrying plasmid CP5533 (G204A) and one 
isolate each of strains CYI316/pYES2 and CY1316/ P YES2-AGS carrying plasmid 
CPl 1 83 (WD were grown on sucrose medium for 2 d. Equal numbers of cells were 
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then resuspended in sterile H2O and spotted onto galactose medium lacking histidine and 
containing ImM AT. Cells were grown 2 d at 30° prior to photography to visualize 
growth. Neither the isolates of strains CY1 3 16/pYES2 or CY1 3 16/pYES2-AGS 
carrying plasmid CP5533 (G204A), exhibited growth above that seen with control strain 

5 CY1316/pYES2 carrying plasmid CP1 183 (WT). By contrast, positive control strain 
CY1316/pYES2-AGS carrying plasmid CP 1 183 (WT) exhibited significant colony 
growth. The G204A mutation is analogous to the G226A mutation in Gas which 
renders this protein unable to form a high affinity Mg 2+ »GTP complex (Lee et al. 
(1992) 1 Biol Chem. 267:1212-1218). The G204A mutation in the chimeric Gpal(i- 

10 4i)-Gcti2 also appears to render it incapable of activation, as evidenced by its failure to 
transduce a receptor-mediated signal to activate fuslp-HIS3 transcription. 

To rule out that AGS expression was affecting the expression of the Gpal(i-4i)- 
Gai2 chimera, membrane preparations from strains CY1316/CP1 183 and 
CY13 16/CP5533 expressing either the pYES2 vector or the pYES2-AGS plasmid and 

15 grown for 20 hr in galactose-containing medium were analyzed by immunoblot analysis 
using antisera specific for the C -terminus of Gai2. No significant differences in 
chimeric Gpal(i-4i)-Gai2 expression were detected in any of these strains. 

These data strongly suggest that AGS activation of the pheromone response 
pathway is occurring at the level of the heterotrimeric G protein. Furthermore, the 

20 inability of AGS to activate the pheromone response pathway in the presence of a Ga 
that is defective in GDP-GTP exchange, and the opposing effects of AGS function and 
RGS4 function on pheromone pathway activation strongly suggest that AGS functions 
by facilitating GTP exchange on the heterotrimeric Ga. 

Finally, the Ga selectivity profile of AGS 1 was determined. The pYES/FUSlp- 

25 lacZ transformants in strain CY1316 (no Ga), or CY13 16 derivatives carrying an 

integrated copy of a GPAl-human Gai2 fusion construct (CY1 141), a GPAl-human Ga 
i3 fusion construct (CY7967), a rat Gas construct (CY8342), a Gi^Mwman Gal6 
fusion construct (CY9603), or yeast GPA1 (CY9571) (all under the control of the yeast 
GPA1 promoter) were grown in selective liquid medium containing either glucose or 
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galactose. Relative P-gaiactosidase activity was then determined on aliquots of 
permeablized cells (Table 3). 



TABLE 3 

P-galactosidase activity 
Strain VECTOR AGS1 

Glucose Galactose Glucose Galactose 



15 



Gas 



NoGa 103.4+/- 5.7 76.0+/- 6.9 145.0+/- 5.9 140.1 +/- 4.4 

Gai2 H.1 +/-0.9 8.7+/- 0.5 5.6+/- 0.6 124.8+/-4.1 

Gai3 10.0+/- 0.6 9.8+/- 0.6 5.6+/- 0.7 63.3 +/- 7.8 

13.3+/- 10 11.9+/- 0.8 10.1+/- 0.5 6.4+/- 1.1 

20 Gal6 23.6+/- 0.7 24.3+/- 1.6 22.5+/- 1.2 22.5+/- 1.6 

Gpal 5.6+/- 0.3 5.4+/- 0.3 3.6+/- 0.5 7.6+/- 0.8 

Analysis of the Go. selectivity profile for AGS 1 in yeast strains carrying afuslp- 
25 lacZ reporter construct and expressing different mammalian Ga constructs indicated Gai 
specificity. Expression of AGS1 resulted in high levels of (3-galactosidase activity in 
strams expressing GPAl^GaV or M1) -Gai3, but not in strains expressing Gas, 
G/^7 (M „-Gal6 or GPA1 itself (Table 3). 

Immunoblot analysis using polyclonal serum against Gpal and Gas indicated 
30 that each of the host strains expressed equivalent amounts of Ga and that expression of 
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AGS1 had no significant effect on the steady-state levels of any of these Ga proteins. 
Similar experiments indicated that expression of AGS1 had no significant effect on the 
steady-state levels of yeast Gp. 

5 EXAMPLE 4: A Counter-Screen for Pheromone Pathway Inhibitors 

With the identification of AGS 1 as a receptor-independent mammalian activator 
of the yeast pheromone response pathway, a novel screen was devised to identify 
pheromone pathway inhibitors. For this screen, a strain was created that had its 
pheromone pathway constitutively activated by AGS1 . Then, a human liver cDNA 

10 library in vector pYES2 was introduced into this strain in an attempt to identify cDNAs 
that, when expressed, counteracted AGS1 function. This screen, therefore, can identify 
proteins that both directly and indirectly regulate AGS1 activity. 

In this screen, the marker gene CAN1 was employed which encodes a yeast 
arginine permease that can transport the arginine analog canavanine (Sikorski (1991) 

15 Meth. Em. 194:302-318). Accordingly, cells expressing Canl and cultured in the 
absence of arginine can transport canavanine and incorporate it into nascent proteins, 
leading to cell death. By fusing CAN1 coding sequences downstream of the FUS1 
promoter and introducing it into yeast strain CY1 141 (Table 1), this strain can be made 
conditionally non- viable upon pheromone pathway activation. Into this strain the AGS1 

20 coding sequences fused downstream of the constitutive yeast promoter PGKJ were 

introduced (Schena et al. (1991), Meth. Enz. 194:389-398). Expression of AGS1 in this 
strain led to constitutive pheromone pathway activation, as evidenced by growth on 
medium lacking histidine and inhibition of growth on medium lacking arginine and 
containing canavanine. 

25 The observation was made that the constitutive expression of human RGS4 

interfered with AGS1 function. Thus, RGS4 was ligated downstream of the GAL1 
promoter in pYES2 and used as an initial proof-of-concept test for the inhibitor screen. 
When introduced into CY1 141 carrying PGKlp- AGS1 and FUSlp-CANl expression 
constructs, pYES2-RGS4 conferred galactose-dependent growth on medium lacking 

30 arginine and containing canavanine. The amount of canavanine used in this test (200 
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fig/ml medium) was determined to be optimal in suppressing the growth of pYES2 
vector transformants without causing general lethality. Therefore the strain was 
transformed with the human liver cDNA library and a inhibitor screen analogous to the 
activator screen was carried out. A schematic of the inhibitor screen is shown in Figure 
5 2 and the screen characteristics are shown in Table 2. A higher level of plasmid- 
independent colony formation was expected from this screen relative to that of the 
activator screen, as any genomic mutation that abolishes pheromone pathway signalling 
should confer growth under restrictive conditions. This background, again, was easily 
eliminated with a secondary screen comparing growth on glucose medium versus growth 
0 on galactose medium. In addition, a mgher level of general background growth in this 
screen was encountered compared with the activator screen. An additional replica- 
plating step in the secondary screen eliminated much of this background (see Table 3). 
In the initial screen of 1.3 x 10' primary transformants, a single cDNA clone (L15) was 
isolated that was capable of conferring growth in a galactose-specific manner under 
selective conditions. Sequencing of the 1 .7 kilobase cDNA insert (SEQ ID NO: 24) 
revealed it to encode human RGS5 (SEQ ID NO: 25). Human RGS5 is an ortholog of 
mouse RGS5, which has been shown to bind in vitro to both Gai and Gao (Chen et al. 
(1997) J. Biol. Chem. 272:8679-8685). The ability to isolate one member of a family of 
known down-regulators of heterotrimeric G-protein signaling pathways not only serves 
to validate this inhibitor screen, but also to reinforce the mechanism of action of AGS1. 

In summary, the functional redundancy found in many key regulatory pathways 
of both yeast and mammalian cells allows for the development of screens designed to 
identify specific modulators of these pathways. The ability to isolate both mammalian 
activators and inhibitors of the yeast pheromone response pathway demonstrates the 
utility of these yeast-based functional screens. Moreover, the short time frame required 
for each screen (1-2 weeks), as well as the relatively high throughput (Table 3), allows 
for rapid saturation screens for a cDNA library of interest. Accordingly, the 
implementation of these screens using cDNA libraries generated from different normal 
or disease-specific tissue sources can be used to identify other novel pathway activators 
or inhibitors. 
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EXAMPLE 5 ; Characterization of Isolated cDNA Clone 

The open reading frame of the 8 isolated clones encodes a 281 amino acid 
protein with homology to small ras-like G proteins. This newly cloned protein was 
termed AGS. A search of GenBank revealed that AGS exhibits homology (96% at the 

5 amino acid level, 85% at the nucleotide level) with a previously identified 

glucocorticoid-induced ras-related protein in mouse (accession number AF009246). 
Sequence comparison with the yeast genome indicated that AGS shares the highest 
degree of homology with RSR1/BUDJ (45% amino acid identity within amino acids 25- 
125 of AGS) and both RAS1 and RAS2 (43% amino acid identity within amino acids 25- 

10 125 of AGS). An alignment of AGS with representatives of all major classes of small G 
proteins in humans (Figures 4A and 4B) indicates that AGS is likely to be the founding 
member of a novel class of small G proteins in humans. 

Though AGS is clearly a member of the ras superfamily, it has several unique 
structural features, including both N- and C-terminal extensions not seen in most small 

15 G proteins. AGS also has alterations in several amino acids that are normally highly 
conserved in all small G proteins. Indeed, AGS shares alterations at three key amino 
acid residues with the recently identified G proteins Rndl, Rnd2 and RhoE/Rnd3 (Foster 
et al. (1996) Mol. Cell. Biol. 16:2689-2699; Nobes et al. (1998) J. Cell Biol. .141:187- 
197) (see Figure 5). RhoE was shown to be deficient in GTP hydrolysis activity, and 

20 this deficiency could be restored by alteration of these three amino acids to their highly 
conserved counterparts (Foster, R. et al. (1996) Mol. Cell. Biol 16:2689-2699). Though 
AGS shares these three amino acid alterations with Rndl, Rnd2 and RhoE/Rnd3, it does 
not share the consensus rho effector (amino acids 42-50 in Rndl) or rho insert (amino 
acids 132-140 in Rndl) domains of these proteins. AGS, therefore, does not appear to 

25 be a member of the rho family of ras-like proteins. 

EXAMPLE 6 : Mutational Analysis of AGS Protein 

A series of point mutations were created in AGS to analyze their effects on 
pheromone pathway activation, as measured by galactose-dependent growth in the 
30 absence of histidine. Mutations at conserved glycine residues 3 1 and 36, which are 
predicted to be in the P site, render AGS unable to confer histidine prototrophy to cells. 
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Surprisingly, mutation of another conserved glycine in the G' site, glycine-81 , does not 
appear to affect the function of AGS, even with growth on medium containing 20 mM 
AT. This glycine mutation is analogous to the G(226)A mutation of Gas and the G204A 
mutation of Gai2, and would normally be predicted to by an inactivating mutation. 

A number of other mutants (listed below in table 4) were tested in the vector 
PYES2 in CY1 141 on medium lacking histidine and containing 1 mM aminotriazole as 
described above. The (3-galactosidase activities (relative to wild-type AGS1) of all of 
the mutants tested are as follows. 
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Construct 



- 105 

13-gal activity (glucose) 
8.5 
11.1 
10.3 
9.3 



Wild type 
G(31)V 
5 G(36)V 
G(81)A 

S(33)G 8.4 

S(80)A 10.1 

N(82)G 12.2 

10 F(44)N not tested 

S(33)G; S(80)A 9.3 

S(33)G; N(82)A 10.1 

S(80)A; N(82)G 9.9 
S(33)G; S(83)A; N(82)G 7.7 

15 S(33)G; S(80)A; N(82)G; 

G(81)A 12.0 

K(225)E; K(226)E 13.0 

C(278)S 14.4 

Vector 10.1 

20 



ft-gal activity (galactose) 

100 

8.2 

10.9 

69.6 



68.6 
104.0 

66.0 
not tested 
39.1 
37.2 
92.6 
71.1 

125.5 

81.0 

16.4 

8.9 



The G(31)V mutant carried a glycine to valine alteration at residue 31 (G31V), 
an absolutely conserved residue in the P-loop of monomelic G proteins that makes 
critical contacts with the a- and P-phosphates of both GDP and GTP (Valencia et al. 
(1991) Biochemistry 30:4637-4648). The C(278)S mutant carried a cysteine to serine 

25 alteration at residue 278 (C278S) within the C-terminal CAAX box. This cysteine is a 
major site of lipid modification for most members of the ras superfamily. Neither of 
these mutations appeared to destabilize the AGS1 protein, as immunoblot analysis of 
both N-terminally His 6 - and GST-tagged mutant proteins showed no significant change 
in the steady-state levels relative to wild-type AGS1 . The introduction of either of these 

30 two amino acid changes in AGS 1 , however, severely impaired its ability to activate the 
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pheromone pathway consistent with AGS 1 function rearing both guanine nucleotide 
binding and post-translational lipid modification. 

EXAMPLE 7; Effect of STE14 on AGS Function 

5 The yeast strain used in the isolation of AGS contained a genomic disruption of 

the STEI4 locus, which encodes a farnesyl cysteinexarboxyi methyltransferase known 
to carboxymethylate the yeast Get, StelS, Rasl and Ras2 proteins as well as a-factor 
(Hrycyna and Clarke (1990) Mol. Cell. Biol. 10:5071-5076; Marr et al. (1990) J. Biol. 
Chem. 265:20057-20060; Hrycyna et al. (1991) EMBO J. J_0: 1 699-1 709). Disruption of 
.0 s t el4 has been found to reduce background signaling through the pheromone response 
pathway in strains carrying chimeric Ga constructs. Because AGS appears to encode a 
small ras-related G protein with a C-terminal CaaX box sequence consistent with 
farnesylation, and because small G proteins are often carboxymethylated after 
famesylation, we examined the effect of addition of an episomal copy otSTEU on AGS 
5 function. STEM gene expression appeared to enhance the fusJ -HIS3 mediated growth of 
strains expressing AGS, though an increase ,n background growth on glucose medium 
was also evident. 



EXAMPLE 8 : Tissue Expression of AGS mRNA 

The tissue-specific expression of AGS in mRNAs isolated from various human 
tissues was measured. A 32 P-,abeled DNA probe was made from the full length coding 
sequence of AGS and hybridized under stringent conditions to mRNA Northern blots 
and an mRNA dot blot generated from human tissues and human cancer cell lines. The 
AGS probe hybridized to a major transcript at approximately 2 kb and a minor transcript 
at approximately 5 kb. The major transcript hybridized most strongly with mRNAs 
derived from liver, skeletal muscle, heart, kidney, brain and placenta, prostate and bone 
marrow tissues, as well as from a HeLa cell line. Interestingly, the relative ratio of the 
major and minor transcripts varies with tissue source, and the major transcript is almost 
entirely absent from mRNA derived from fetal liver. Upon longer exposure times 
hybridization to the AGS probe is detectable in all mRNA samples, probably indicating 
low-level of AGS expression in all tissues. 
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EXAMPLE 9 : Chromosomal localization of AGS1 

In a further characterization of the AGS 1 gene, the chromosomal locus of AGS 1 
was determined according to the protocol of Genome Systems, Inc. (St. Louis, MO). A 

5 full length AGS1 cDNA (SEQ ID NO: 1) was used to identify a PI genomic clone from 
a human library. This PI clone ; with fluorescence in situ (FISH), localized the AGS1 
gene to chromosome 17 in band 17pl 1.2. AGS1 sequences were localized to 38% of the 
distance from the centromere to the telomere of chromosome arm 17p, and 71 out of 80 
metaphase chromosome spreads that were analyzed exhibited specific labeling with an 

10 AGS 1 probe. The homolog of AGS 1 has been (as described in Example 1 5) localized to 
chromosome 22 in band 22ql3.1. A PI genomic clone carrying AGS1 has been 
analyzed by PGR amplification using the oligo sequences 
5 ' TTCTCGCGG ATGTAC ATG A3 3 ' (SEQ ID NO: 43) and 

5'TCCACCGCAAGTTCTACTCC3' (SEQ ID NO: 44) and verified, by size analysis, to 
1 5 contain an AGS 1 DNA sequence. 

EXAMPLE 10 : Expression of AGS Protein in Host Cells 

Coding sequences for AGS were amplified by PCR for expression in yeast, E. 

coli, insect, and mammalian cells. These sequences were ligated downstream of coding 
20 sequences for either GST or His6 epitopes, and protein expression levels were monitored 

by immunoblot analysis using antisera directed against the respective fusion partners. 

Inducible expression of GST-tagged AGS in yeast and bacteria was easily detectable. 

The expression of mutant forms of AGS in yeast, in which glycine residues 36 and 81 

were changed, respectively, to valine and alanine was also detectable, as was the 
25 expression of His6-tagged AGS in insect cells. A His6-tagged version of AGS was also 

introduced into the mammalian cell line HEK293, and stable transfectants generated. 

The expression of tagged AGS mRNA in these cell lines was detectable, but expression 

of AGS protein was not, indicating that AGS protein may be unstable in this cell line. 
However, both wild type and AGS1 mutant G(31)V have been expressed and 
30 detected in mammalian CHO-K1 and COS-7 cells and both polypeptides are membrane 

associated. No AGS1 protein is detectable in the soluble fraction. In addition, in yeast, 
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bound at 40 min in the absence of AGS 1). AGS1 also was able to enhance GTPyS 
binding to a similar extent on purified brain heterotrimeric G-protein, where the majority 
of Get in the heterotrimer is Gao, suggesting AGS1 also functions on Goto and functions 
independently of the presence of Gpy. These observations, in combination with the in 

5 vivo epistasis and Ga selectivity data, indicate that AGS 1 likely functions by catalyzing 
guanine nucleotide exchange on selected Ga proteins. Purified GST- AGS 1 alone does 
not bind GTPyS under these conditions, nor does it bind GTPyS when presented in the 
absence of excess GDP. Two control experiments were done to assess where the 
labelled GTPyS was being bound in these assays. First, Ni-NTA agarose and glutathione 

10 sepharose were used to pull out either His 6 -Gai2 or GST-AGS 1 after 40 min, and the 
majority of GTPyS counts were found on the Ni-NTA column. Second, purified Gcti2 
was pre-loaded with non-labelled GTPyS, dialyzed, and used in the assay. This led to an 
approximately 70% decrease in label bound. The converse experiment, in which 
purified GST- AGS 1 was pre-loaded with cold GTPyS prior to running the assay showed 

15 only an approximately 1 5% decrease in label bound. 

EXAMPLE 13: Induced Expression of AGS-1 Activities ERK1/ERK2 Pathway 

AGS1 and a His 6 -tagged version of AGS1 have also been constructed in the 
ecdysone (ponasterone)-inducible mammalian expression vector pIND (Invitrogen). 
20 using standard techniques. Upon induction, assays for determining activation of MAP 
kinase pathways were performed on cells carrying inducible pIND-AGSl constructs. 
These assays indicated that in HEK293 cells, the ERK1/ERK2 pathway is activated 
upon induction of expression of AGS1 and that this activation is abolished by 
pretreatment of cells with pertussis toxin (Ptx) (Table 5). 
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TableS 



ERK1/ERK2 activation (cpm) +/- S.F 



21710+/- 1530 



32070+/- 1270 



M. 



pIND-AGSl+Ptx 23700+/- 960 
(uninduced) 



pIND-AGSl+Ptx 26460+/- 400 
15 (induced) 

EXAMPLE": GTPasc Analysis of AGS1 
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after purification from strain CY4600 (pathway off). AGS1 ? however, seemed to be 
bound only to GTP when purified from either strain background — no GDP was detected 
at all in either sample. Relative to Cdc42, it is estimated that less than 1% of AGS1 is 
bound to nucleotide after purification. 
5 Taken together, the above biochemistry suggests that AGS1 can only bind 

guanine nucleotides poorly and has lower affinity for GDP as compared to GTP. Given 
that its GTPase activity is comparable (mohmol) with Cdc42, this would suggest that 
purified AGS1 may have a higher than normal hydrolysis activity. Accordingly, it is 
likely that only a small fraction of purified AGS1 is active, accounting for the relatively 
10 low stimulation of GTPyS binding seen when used in the activation assay and for its 
apparent instability. 

EXAMPLE 15 : Cloning and Analysis of a Human AGS1 Homoiog. 

By sequence comparison, a related human AGS1 homoiog was identified from 

15 the GenBank database (Accession No.: CAA18456). Alignment of the AGS1 

polypeptide sequence to the AGS1 homoiog indicated that these proteins were 62% 
identical when comparing residues 5-280 of AGS1 (SEQ ID NO: 2) to residues 12-227 
of the AGS1 homoiog (SEQ ID NO: 41). In addition, the cDNAs encoding these 
proteins were 82% and 84% identical when comparing AGS1 nucleotide bases 102-438 

20 and 510-646 of SEQ ID NO: 1 with, respectively, AGS1 homoiog nucleotide bases 123- 
459 and 531-667 of SEQ ID NO: 40. The overall greatest divergence between these two 
molecules is found in the C-terminal or 3' region of these molecules. 

Given the above sequence similarity between AGS1 and the AGS1 homoiog, a 
functional analysis of these related molecules was conducted.. The cDNA for the AGS1 

25 homoiog was amplified by PCR from a human liver cDNA library and subcloned into 
pYES2 vector using engineered 5'BamHI and 3' EcoRI sites (as had been done for 
AGS1 itself). Expression of this homoiog did not confer growth to CY1316/1 183, nor 
did it lead to FUSlp-lacZ activity in CY1 141 (see Table 6). A common internal MscI 
site in both AGS 1 and the AGS1 homoiog was taken advantage of in order to exchange 

30 the C-terminal regions between these two sequences. The chimeric expression 

constructs were called AH (for AGS1 head and Homoiog tail) and HA (for Homoiog 
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Construct 



3 



Functional Analysis of AGS1, AGS! Ho m „. og , and Chimeric 
Proteins 
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1 . An isolated nucleic acid molecule which encodes an AGS protein, 
comprising a nucleotide sequence having at least 86% identity to the nucleotide 

5 sequence of SEQ ID NO: 1 , or the complement thereof. 

2. The isolated nucleic acid molecule of claim 1, which comprises a 
nucleotide sequence having at least 90% identity to the nucleotide sequence of SEQ ID 
NO:l or SEQ ID NO:3, or the complement of SEQ ID NO: 1 or SEQ ID NO: 3. 

10 

3. The isolated nucleic acid molecule of claim 1 , which comprises a 
nucleotide sequence having at least 95% identity to the nucleotide sequence of SEQ ID 
NO:l or SEQ ID NO:3 5 or the complement of SEQ ID NO: 1 or SEQ ID NO: 3. 

15 4. The isolated nucleic acid molecule of claim 1, which comprises the 

nucleotide sequence of SEQ ID NO:l, or the complement thereof. 

5. The isolated nucleic acid molecule of claim 1 , which comprises the 
nucleotide sequence of SEQ ID NO: 3, or the complement thereof. 

20 

6. The isolated nucleic acid molecule of claim 1 , which encodes a protein 
that activates G protein-coupled signal transduction in a G protein-coupled receptor 
independent manner. 

25 7. The isolated nucleic acid molecule of claim 1 , which is a human nucleic 

acid molecule. 

8. An isolated nucleic acid molecule comprising a nucleotide sequence 
encoding a protein which comprises an amino acid sequence having at least 97% identity 
30 to the amino acid sequence of SEQ ID NO:2. 
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9. The isolaied nucieic acid molecule of ^ „ ^ ^ 
conges . amino acid sequence hlving al leas , 98% 

sequence of SEQ ID NO:2. 

10. The isolate, nucleic acid molecule of da™ 8 , wherein u,e protein 
-pn.es an amino acid sequence having a, ,« „ % ide„ tily l0 fc mino 
sequence of SEQ ID NO:2. 



11. ^-'""-leicacidmotaieofeiatas,^,,^ 
10 comprising the amino acid sequence of SEQ ID NO:2. 

12. The rsolated nucleic acid molecule of data 8, wherein said proreh, 

independent manner. P 



15 



acid molecule. 



■3. ^ i »^»^.cidmo,ec„ 1 e„fc,aim8,whichi sah » ma , nudeic 



14. A 



20 



vector comprising the nucleic acid molecule of claim 



15- Thevectorofclaun 1 4, which is a recombinant expression vector. 
16. A host cell containing the vector of claim 14. 

" cellofcL^^ 

of clann 16 m a suable medium such that AGS protein is produced. 

from J" „■ ThCmeth0d0fC ^ 
from the medium or the host cell. 

30 
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19. A nonhuman transgenic animal which contains cells carrying a transgene 
encoding an AGS protein. 

20. An isolated AGS protein comprising an amino acid sequence having at 
5 least 97% identity to the amino acid sequence of SEQ ID NO:2. 

2 1 . The isolated protein of claim 20, comprising an amino acid sequence 
having at least 98% identity to the amino acid sequence of SEQ ID NO:2. 

10 22. The isolated protein of claim 20, comprising an amino acid sequence 

having at least 99% identity to the amino acid sequence of SEQ ID NO:2. 

23. The isolated protein of claim 20, comprising the amino acid sequence of 
SEQ ID NO:2. 

15 

24. A fusion protein comprising at least a portion of the AGS protein of claim 
20, operatively linked to a non-AGS polypeptide. 

25. An antibody that specifically binds the AGS protein of claim 20. 

20 

26. The antibody of claim 25, which is monoclonal. 

27. The antibody of claim 25, which is labeled with a detectable substance. 

25 28. A pharmaceutical composition comprising the protein of claim 20 and a 

pharmaceutical^ acceptable carrier. 

29. A pharmaceutical composition comprising the antibody of claim 25 and a 
pharmaceutically acceptable carrier. 

30 
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30. A m e thod for iden[ifying a compouii() ^ mduiatK 

in a cell, comprising: """ion 

•»**« a cdl ta expresses an AGS pro,e,n wi.h a ,es, compound- 
^ ^ deKrai " i ^^««>of.he, e s 1 c„ mp ou nd „„, he ac l iv iVoftheAGSprolein; 

id-as*, te „, compound _ , modulator of ^ 

of .he compound t0 ^ fa ^ rf ^ ^ ^ ^ 

,» ac irf J heM "° fC '" m30 ' Wherei " ,heAGS P«>'- -mprises an a^ino 

«- ^ence ha» ing a, ,e,s, 97% idenn* 10 SEQ ID N0: 2 ffid slimulalK G ^ 

aamty m a receplor-independen. manner. 

32- Tie memod „f claim 30> ^ AQS ^ ^ ^ 

" ,c„ Ueme,hOd ° fC ' ai, " 30 ' W ' ,erem,heAGS ^Prises .heam.no 

acid sequence of SEQ ID NO: 2. «= amino 

d. *rT • T " n,e,h< " 1 ° f daim * WhCrei " ' he Ce " te - «P-s 



35. 



H» melhod of dam, 30, wherein .he cell ba s fi,^, tem ^ 
express a G protein a subunit. 



25 36. 



The method of claim 30, wherein the eel, is a yeast ce„ that has been 
—dtoexpressam^ 

pheromone response pathway in the yeast cells. 
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37. The method of claim 36, wherein the yeast cell has been engineered to 
express a Gpal-Gai2 chimeric G protein a subunit. 

38. The method of claim 36, wherein the pheromone response pathway in the 
5 yeast cells is monitored by measuring the activity of a pheromone responsive promoter 

in the yeast cells. 

39. The method of claim 30, wherein the effect of the test compound on the 
activity of the AGS protein is determined by monitoring the ability of the test compound 

10 to bind to the AGS protein. 

40. The method of claim 30, wherein the effect of the test compound on the 
activity of the AGS protein is determined by monitoring the ability of the test compound 
to modulate the interaction of the AGS protein with a target molecule. 

15 

41 . The method of claim 40, wherein the target molecule is a G protein. 

42. A method for modulating G protein coupled signal transduction in a cell 
comprising contacting a cell with an agent which modulates AGS protein activity or 

20 AGS nucleic acid expression such that G protein coupled signal transduction is 

modulated in the cell, when compared with G protein coupled signal transduction in the 
cell in the absence of the agent. 

43. The method of claim 42, wherein the cell-associated activity is a G- 
25 protein mediated activity. 

44. The method of claim 42, wherein the agent stimulates an AGS protein 
activity or AGS gene expression. 

30 45. The method of claim 42, wherein the agent is an active AGS protein. 
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AGS that has been introduced into the cell. 



47. The method of clai m 42, wherein the agent inhibits an AGS protein 
5 activity or AGS gene expression. 



49. The method of claim 42, wherein the agent is an antibody that 
specifically binds to an AGS protein. 



50 ^methodofclai m 42,whereinthecelli S p re s e ntwithinasubjectand 
the agent ,s administered to the subject. 



15 



20 



51. Amethod for , re . ting a subject having a d i5orfer cWeh.ed by M 
subject an AGS modulator such that treatment of the subject occurs. 

samp,. A meth ° d f " ^ ^ ° f " AGS * " 

-Pie compnsmg contacting a biologica, S »p,e wit, an agent cap* of detec ,„g the 

AOS pr„«,„ or mRNA such tha, the presence of the AGS ptotein , s deKctt d i„ me 

biological sample. 



53 The method of claim 52, wherein the agent is a labeled or labelable 
nuclei acid probe capable of hybridizing to an AGS mRNA. 

54. ^methodofclaim52,whereinth e agentisalab e ledorlabe.able 
anttbody capable of specifically binding to an AGS protein 

30 
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55. A method for identifying a compound that activates a signal transduction 
pathway in a cell, said method comprising, 

providing a cell that undergoes a measurable change when the signal transduction 
pathway is activated; 
5 contacting the cell with a test compound; and 

determining if the test compound causes a measurable change to 

thereby identify the test compound as a modulator of the signal transduction 
pathway. 

10 56. The method of claim 55, wherein said signal transduction pathway is a G 

protein signaling pathway. 

57. The method of claim 56, wherein said ceil is a yeast cell. 

15 58. The method of claim 57, wherein said G protein signaling pathway is the 

yeast pheromone response pathway. 

59. The method of claim 58, wherein said compound is a polypeptide 
encoded by a nucleic acid molecule. 

20 

60. The method of claim 59, wherein said nucleic acid comprises a nucleotide 
sequence having at least 70% identity to the nucleotide sequence of SEQ ID NO:l or 
SEQ ID NO:3. 

25 61 . The method of claim 59, wherein said nucleic acid comprises a nucleotide 

sequence having at least 80% identity to the nucleotide sequence of SEQ ID NO:l or 
SEQ ID NO:3. 

62. The method of claim 59, wherein said nucleic acid comprises a nucleotide 
30 sequence having at least 90% identity to the nucleotide sequence of SEQ ID NO: 1 or 
SEQ ID NO:3. 
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63. The method of clai m 59, wherein said nucleic acid comprises a nuCeotide 

^T N or at least 95% identity to the nuc,eotide sequence ° f seq id N ° :i ° r 



64. The method of claim 59, wherein said nucleic acid comprises the 
nucleotide sequence of SEQ ID NO:l. 

65. The method of claim 59, wherein said nucleic acid comprises the 
1 0 nucleotide sequence of SEQ ID NO: 3 

66. The method of claim 59, where.n said polypeptide activates G protein 
ngnalaig in a G protein coupled-receptor independent manner. 

67 ™ e method of claim 59, wherem said nucleic acid comprises a nucleotide 
sequence which is a human. 

68. The method of claim 59, wherein said polypeptide comprises an amino 
actd sequence having at least 97% identity to the amino acid sequence of SEQ ID NO:2. 

69. ^e method ofclaim 59, wherein the polypeptide comprises an amino 
acd sequence having at least 98% identity to the amino acid sequence of SEQ ID NO:2. 

70. The method of claim 59, wherein the polypeptide comprises an amino 
actd sequence having at least 99% identity to the amino acid sequence of SEQ ID NO:2. 

71. The method of claim 59, wherein the polypeptide comprises the amino 
acid sequence of SEQ ID NO:2. 



72. The method of claim 30, wherein the compound is a nucleic acid 



20 



encoding a polypeptide capable of inhibiting the activity of the 



AGS protein. 
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73. The method of claim 72, wherein said nucleic acid comprises the 
sequence provided in SEQ ID NO: 24. 

5 74. The method of claim 72, wherein said nucleic acid encodes the 

polypeptide sequence provided in SEQ ID NO: 25. 

75. The method of claim 30, wherein the cell further comprises a nucleic acid 
encoding an inhibitor of the AGS protein. 

10 

76. The method of claim 75, wherein said nucleic acid comprises the 
sequence provided in SEQ ID NO: 24. 

77. The method of claim 75, wherein said nucleic acid encodes the 
15 polypeptide sequence provided in SEQ ID NO: 25. 

78. The method of claim 75, wherein said ability of the test compound to 
modulate the activity of the AGS protein indicates that the test compound is a modulator 
of the inhibitor of the AGS protein. 

20 
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SEQUENCE LISTING 

<110> Duzic, Emir et al . 

<»0. AOS PROTEINS AND NUCLEIC ACID MOLECULES AND USES THEREOF 

<130> CPI - 086CPPC 



<140> 
10 <141> 

<160> 72 



15 



20 



<170> Patentln Ver. 2.0 

<210> l 
<211> 846 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (i) . , {343} 



25 <400> 1 



40 



144 



192 



240 



K IT. 2 E K 2 K "I "! 2 * £ 2 - 2 K " 

25 30 

tec aag gtg ggc aag acq qcc arr rn-n ^ 
3= 3« ,,. V„ „, Ly , T J J.' « S ~ «« « -c „ J. 

40 45 

^ fx" £ £ £ ^ ^ f~ "c c ac cgc aag ttc 

50 y r ° Thr Ile Glu As P Phe His Arg Lys Phe 

bb SO 

£ - s G 93 y - 9 *s T ; r c G c - r r r ctc 9ac - 

65 70 P Ile Leu As P Thr Ser 

E E £ 2 £ 2 E 2 £ S S 2 S £ 2 5 !8S 

90 95 

0 K va? £ £ v 9 I? £ £ Asp r ^ "« » S 

100 SP Asn Ar 9 Asp Ser Phe Glu 

105 110 

■ s s a s e s s = 2 2 - 2 £ s; 2 
"i jjf s 2 s: 2 a % s 2 a s s s ~ - 

135 140 
- a ag ggc gac cgc gac fctc ^ ^ ^ ^ ^ ^ 
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- 2 - 

Asn Lys Gly Asp Arg Asp Phe Tyr Arg Glu Val Asp Gin Arg Glu lie 

145 150 155 160 

gag cag ctg gtg ggc gac gac ccc cag cgc tgc gcc tac etc gag ate 528 

5 Glu Gin Leu Val Gly Asp Asp Pro Gin Arg Cys Ala Tyr Phe Glu lie 

165 170 175 

teg gcc aag aag aac age age ctg gac cag atg etc cgc gcg etc tzc 576 

Ser Ala Lys Lys Asn Ser Ser Leu Asp Gin Met Phe Arg Ala Leu Phe 

10 180 185 190 



15 



45 



gcc atg gcc aag ctg ccc age gag atg age cca gac ctg cac cgc aag 
Ala Met Ala Lys Leu Pro Ser Glu Met Ser Pro Asp Leu His Arg Lys 
195 200 205 



gac gcc ttt ggc ate gtg gca ccc ttc gcg cgc egg ccc age gta cac 
25 Asp Ala Phe Gly He Val Ala Pro Phe Ala Arg Arg Pro Ser Val His 
245 250 255 

age gac etc atg tac ate cgc gag aag gcc age gcc ggc age cag gcc 
Ser Asp Leu Met Tyr He Arg Glu Lys Ala Ser Ala Gly Ser Gin Ala 
30 260 265 270 

aag gac aag gag cgc tgc gtc ate age tag 
Lys Asp Lys Glu Arg Cys Val He Ser 
275 280 

35 

<210> 2 
<211> 281 
<212> PRT 
40 <213> Homo sapiens 

<400> 2 

Met Lys Leu Ala Ala Met He Lys Lys Met Cys Pro Ser Asp Ser Glu 
15 10 15 

Leu Ser He Pro Ala Lys Asn Cys Tyr Arg Met Val He Leu Gly Ser 
20 25 30 



Ser Lys Val Gly Lys Thr Ala He Val Ser Arg Phe Leu Thr Gly Arg 
50 35 40 45 

Phe Glu Asp Ala Tyr Thr Pro Thr He Glu Asp Phe His Arg Lys Phe 
50 55 60 

55 Tyr Ser He Arg Gly Glu Val Tyr Gin Leu Asp He Leu Asp Thr Ser 
65 70 75 80 

Gly Asn His Pro Phe Pro Ala Met Arg Arg Leu Ser He Leu Thr Gly 
85 90 95 

60 

Asp Val Phe He Leu Val Phe Ser Leu Asp Asn Arg Asp Ser Phe Glu 
100 105 110 



624 



gtc teg gtg cag tac tgc gac gtg ctg cac aag aag gcg ctg egg aac 672 

Val Ser Val Gin Tyr Cys Asp Val Leu His Lys Lys Ala Leu Arg Asn 
210 215 220 

20 aag aag ctg ctg egg gcc ggc age ggc ggc ggc ggc ggc gac ccg ggc 720 

Lys Lys Leu Leu Arg Ala Gly Ser Gly Gly Gly Gly Gly Asp Pro Gly 
225 230 235 240 



768 



816 



846 
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«u V.! cin Arg , eu Arg cln ; ne Leu Asp ^ 

1Z0 125 
5 Lys Thr Lys Glu val ^ ne ^ 

1J5 140 
Asn Lys My Asp Arg phe v ^ ^ 

10 150 1S5 16() 

Glu C ln L eu Val Gly Asp Asp Pro Arg Cys ^ ^ ^ 

170 

is S« Ma Lys J,. Asn ser ser flin ^ ^ J ^ 

185 190 
-a He, ;; . Lys Leu Pro ser l ^ ^ ^ 

'0 V* Ser v.! 01n Tyr Cys val ^ Aia l ^ a ^ 

* ib 220 

225 ^ Leu Arg ^ *r «y ciy oi y Gly Gly Asp Pro Giy 

5 235 

Asp Ala Phe Gly III val A1 * -° »»• - pro ser val r 

^ U 255 
o Ser ASP Leu Tyr ne Arg fllu ^ au ^ 



255 

Ala Ser Ala Gly Ser 
265 27Q 

Lys Asp Lys Glu Arg Cys Val lie Se^ 
275 



280 



<210> 3 

<211> 1801 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (154) . . (996) 

<400> 3 



g ,aa ttC c ga gcggagccgg agccccaagc ^ 
cca gccg c tc accccgcgtg ccaccccagc ^^^^ ^^^^^ cctcctct ^ ^ 
9 c C c cgcgcc cgccctcgcg gcccctctgc _ ^ ^ ^ ^ 

Met Lys Leu Ala Ala Met lie 

"* ~ £ s x z; s a a s s - s 2 ~ - 

15 20 
tgc tat cgc atg gtc ate etc gg C tea tec aaa ~t- 

Cys Tyr Arg Met Val He Leu Glv w e f 9 5 9gC 339 3Cg gcc 27 ° 
25 ^ Gly Ser Ser L VS Val Gly Lys Thr Ala 

JU 35 

«c gC9 teg cgc ttc ctc acc ggc cgc ^ gac gcc ^ ^ ^ 
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He Val Ser Arg Phe Leu Thr Gly Arg Phe Glu Asp Ala Tyr Thr Pro 
40 45 50 55 

acc ate gag gac ttc cac cgc aag ttc tac tec ate cgc ggc gag gtc 
5 Thr He Glu Asp Phe His Arg Lys Phe Tyr Ser He Arg Gly Glu Val 
60 65 70 

tac cag etc gac ate etc gac acg tec ggc aac cac ccg ttc ccc gec 
Tyr Gin Leu Asp He Leu Asp Thr Ser Gly Asn His Pro Phe Pro Ala 
10 75 80 85 

atg egg cgc etc tec ate etc aca gga gac gtt ttc ate ctg gtg ttc 
Met Arg Arg Leu Ser He Leu Thr Gly Asp Val Phe lie Leu Val Phe 
90 95 100 



15 



20 cag ate etc gac acc aag tct tgc etc aag aac aaa acc aag gag aac 
Gin He Leu Asp Thr Lys Ser Cys Leu Lys Asn Lys Thr Lys Glu Asn 
120 125 130 135 

gtg gac gtg ccc ctg gtc ate tgc ggc aac aag ggt gac cgc gac ttc 
2 5 Val Asp Val Pro Leu Val lie Cys Gly Asn Lys Gly Asp Arg Asp Phe 
140 145 150 

tac cgc gag gtg gac cag cgc gag ate gag cag ctg gtg ggc gac gac 
Tyr Arg Glu Val Asp Gin Arg Glu He Glu Gin Leu Val Gly Asp Asp 
30 155 160 165 

ccc cag cgc tgc gee tac ttc gag ate teg gee aag aag aac age age 

Pro Gin Arg Cys Ala Tyr Phe Glu lie Ser Ala Lys Lys Asn Ser Ser 

170 175 180 

35 

ctg gac cag atg ttc cgc gcg etc ttc gec atg gee aag ctg ccc age 

Leu Asp Gin Met Phe Arg Ala Leu Phe Ala Met Ala Lys Leu Pro Ser 

185 190 195 

4 0 gag atg age cca gac ctg cac cgc aag gtc teg gtg cag tac tgc gac 
Glu Met Ser Pro Asp Leu His Arg Lys Val Ser Val Gin Tyr Cys Asp 
200 205 210 215 

gtg ctg cac aag aag gcg ctg egg aac aag aag ctg ctg egg gee ggc 
4 5 Val Leu His Lys Lys Ala Leu Arg Asn Lys Lys Leu Leu Arg Ala Gly 
220 225 230 

age ggc ggc ggc ggc ggc gac ccg ggc gac gec ttt ggc ate gtg gca 
Ser Gly Gly Gly Gly Gly Asp Pro Gly Asp Ala Phe Gly He Val Ala 
50 235 240 245 

ccc ttc gcg cgc egg ccc age gta cac age gac etc atg tac ate cgc 

Pro Phe Ala Arg Arg Pro Ser Val His Ser Asp Leu Met Tyr lie Arg 

250 255 260 

55 



366 



414 



462 



agt ctg gac aac cgc gac tec ttc gag gag gtg cag egg etc agg cag 510 
Ser Leu Asp Asn Arg Asp Ser Phe Glu Glu Val Gin Arg Leu Arg Gin 
105 HO H5 



558 



606 



654 



702 



750 



79B 



846 



894 



942 



990 



gag aag gee age gec ggc age cag gee aag gac aag gag cgc tgc gtc 
Glu Lys Ala Ser Ala Gly Ser Gin Ala Lys Asp Lys Glu Arg Cys Val 
265 270 275 

60 ate age taggagcccc gccgcgctgg cgacacaacc taaggaggac ctttttgtta 1046 
He Ser 
280 



15 



20 



WO 99/58670 

PC77US99/I0I5I 

•~ tw «„ 9 3 „ sag , 5c5 nt 
, — — „ _ 

« 59g „„ 5 ^ s _ 

attgaggccc gccaccccca taa ^^f 

9 ~- , M . 

.— „ 3 ,„ cw ^ - 

~ t 3tJ ,_ s se , g . s „ [c 

aggaacacaa aacaaaactt: gcactttaan 

gcactttaac ggtagtcccg acqtcaar.h 

a .i-ytcaacat ggacacgaac i^rc 

— t _ „„,„ 5tt ; 5 

.,1 — — — — .« 

~ "*""'" — — <« - 

a t cta gaggg ccgcatcaCg taatcagtca 



1801 



30 

<210> 4 
<2ll> 34 
<212> DNA 



35 llll* * rtific "l sequence 
35 <223> Probe/Primer 



<400> 4 

"catggagc tcaaaccgtt ac t a ttaggt gccg 

40 34 



<210> 5 
<211> 54 
<212> DNA 



45 Itll* ft:xf lciaa Eequ ence 
45 <223> ^obe/Primer 

<400> 5 

c tgctggtcg acgcggccgc tcar.f^t- 

9 tca tataata ccaatttttt tMggtcctg ctgg 

<210> 6 
<211> 28 
<212> DNA 

55 <223> Probe/Primer 
<400> 6 

gtttgacgtg gg tgctcagc ggtcCgag 
60 28 
<210> 7 
<2ll> 29 



WO 99/58670 



PCMJS99/10151 



- 6 - 

<212> DNA 

<213> Artificial sequence 
<223> Probe/Primer 

<400> 7 

ctcagaaccg ctgagcaccc acatcaaac 2 ^ 



<210> 8 
10 c211> 35 
<212> DNA 

<213=> Artificial sequence 
<223> Probe/Primer 

15 <400> 8 

cgggatccat gaaactggcc gcgatgatca agaag 



<210> 9 
20 <211> 28 
<212> DNA 

<213> Artificial sequence 
<223> Probe/Primer 

25 <400> 9 

ggaattccta gctgatgacg cagcgctc 



<210> 10 
30 <211> 25 
<212> DNA 

<213> Artificial sequence 
<223> Probe/Primer 

35 <400> 10 

ggatccatgc aaacgctaaa gtgtg 



<210> 11 
40 <211> 31 
<212> DNA 

<213> Artificial sequence 
<223> Probe/Primer 

45 <400> 11 

gaattcgact acaaaattgc acatttttta c 31 



<210> 12 
50 <211> 31 
<212> DNA 

<213> Artificial sequence 
<223> Probe/Primer 

55 <400> 12 

cgcatggtca tcctcgtttc gtccaaggtg g 31 



c210> 13 
60 <211> 31 
<212> DNA 

<213> Artificial sequence 



WO 99/58670 



PCTVUS99/1015I 



10 



<223> Probe/Primer 
<400> 13 

ccaccttgga cgaaacgagg atgaccatgc g 

<210> 14 
<211> 35 
<212> DNA 

<213> Artificial sequence 
<223> Probe/Primer 

<400> 14 

K gcccgtccaa gg t g gtCaag acggccatcg CgtC g 

<210> 15 
<211> 35 
<212> DNA 
>° <213> Artificial sequence 
<223> Probe/Primer 

<400> 15 

s cgacacgatg gccgtctcaa ccaccttgga cgagc 

<210> 16 
<211> 32 
<212> DNA 
> <213> Artificial sequence 
<223> Probe/Primer 

<400> 16 

cctcgacacg tccgctaacc acccgttccc eg 

<210> 17 
<211> 32 
<212> DNA 

<213> Artificial sequence 
<223> Probe/Primer 

<400> 17 

eggggaaegg gtggttagcg gacgtgtcga 

<210> 18 
<211> 8 
<212> PRT 
<213> Unknown 



31 



35 



35 



32 



32 



acid; Xaa at position 8 may be 



<400> 18 

01/ Xaa Xaa Xaa Xaa Gly Lys Xaa 
1 5 



WO 99/58670 



PCT7US99/I0151 



8 



<212> PRT 
<213> Unknown 

<220> Xaa's at positions 2,3 may be any amino acid 

5 

<400> 19 
Asp Xaa Xaa Gly 
1 



10 



15 



<210> 20 

<211> 4 

<212> PRT 

<213> Unknown 



<220> Xaa at position 3 may be any amino acid 



<400> 20 
Asn Lys Xaa Asp 
20 1 



<210> 21 

<211> 5 

25 <212> PRT 

<213> Unknown 



<220=> Xaa at position 2 may be any amino acid 



30 <400> 21 

Glu Xaa Ser Ala Lys 
1 5 



35 <210> 22 
<211> 4 
<212> PRT 
<213> Unknown 



40 <220> Xaa at position 2 or 3 may be any aliphatic amino acid residue 
<220> Xaa at position 4 may be any amino acid 



<400> 22 
Cys Xaa Xaa Xaa 
45 1 



<210> 23 
<211> 7 
50 <212> PRT 

<213> Homo sapiens 

<400> 23 

Gin Ala Lys Asp Lys Glu Arg 
55 1 5 



<210> 24 
<211> 1691 
6 0 <212> DNA 

<213> Homo sapiens 



WO 99/58670 

PCT/US99/10151 

- 9 - 

<220> 

<221> CDS 

<222> <45) . . f587) 



<400> 24 



- 3aagttg tactcaaagc 3gaggagcta agccacctgc t ^ ^ ^ ^ 

Met Cys Lys Gly 
1 

10 s: e e s e e l- £ s e ^ e s is e e '« 

>. ~ If: s ~ ~ - - i "J e e s e* e 

£ e zi L- e e 2 e A <* r r «* J .„ « c 

55 ^ Trp Ar 9 As P S ^r Leu Asp Ly S Leu 

25 60 65 

ctg cag aac aac tat gga ctt qcc aat 

u. «. «„ ^ Gly - £ - S «. «. «. «« 

80 

10 e e s e e z e a e 5 e e e e r - 

H5 90 F xxe AIa Cys Glu Asp Tyr 

95 100 

■ E E Z E E E E E E E 5! E E E E 

110 115 

, E 1" E E E E E E E E E E S E E E 
- »i - E E E S - E E ST E E E E 

140 145 

Ser Ser £ Isp Tel III ^ £ £ £ £ ^ "* -9 "6 

150 ys Ile Hl s Ala Leu Met Glu Lys 

b 160 

gat tct ctg cct cgc ttt gt 

S Ser Leu Pro Arg , V J - £ ^ - £ ca 3 ^ t 584 

175 

"i "'"""* "<™«" t «„ gtt , t tj7 

«— * „ eet , c , t , tetMtMe gcaKtttsc m 
<-».**. 39g , tgtt9C „. ctctett ejcc _ >ot ^ 
« ,^.. S30 « t « tec « „„ c „ ttc eecct9ocet ^ 



200 



243 



296 



344 



392 



440 



488 



WO 99/58670 PCT/US99/10151 
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aagatatctt cagcttgtag tcagacccaa tcagaatcac agaaaaatcc tgcctaaggc 877 
aaagaaatat aagacaagac tatgatatca atgaatgtgg gttaagtaat agatttccag 937 
5 ctaaattggt ctaaaaaaga atattaagtg tggacagacc tatttcaaag gagcttaatt 997 
gatctcactt gttttagttc tgatccaggg agaccacccc tctaattatt tctgaacttg 1057 
gttaataaaa gtttataaga tttttatgaa gcagccactg tatgatattt taagcaaaca 1117 

10 

tgttatttaa aatattgatc cttcccttgg accaccctca tgttagttgg gtattataaa 1177 
taagagatac aaccatgaat atattatgtt cacacaaaat caatctgaac acaattcata 1237 
15 aagatttctc ttttatacct tcctcactgg ccccctccac crgcccatag tcaccaaatt 1297 
ccgccttaaa tcaatgacct aagatcaaca atgaagtatt ttataaatgt atttatgctg 1357 
ctagactgtg ggccaaatgt ctccattttc aaactatcta gaatccctat gagtctaaaa 1417 

20 

tttgcaaatt tctaaatcca atcatgtaaa acgaaactgt tgctccattg gagtagtctc 1477 
ccacccaaat accaagatgg ctacatgcta aaaagagaaa acatggtcaa gtctaaaatg 1537 
25 gctaattgtc ctatgatgct attatcatag actaatgaca tttaccttca aaacaccaaa 1597 
ttgtctttag aaaaactaat gtgattacag gtagaggcct tctaggtgag acacttttaa 1657 
ggtacactgc attttgcaaa aaaaaaaaaa aaaa 1691 

30 

<210> 25 
<211> 181 
<212> PRT 
35 <213> Homo sapiens 

c400> 25 

Met Cys Lys Gly Leu Ala Ala Leu Pro His Ser Cys Leu Glu Arg Ala 
15 10 15 



40 



Lys Glu lie Lys He Lys Leu Gly He Leu Leu Gin Lys Pro Asp Ser 
20 25 30 



Val Gly Asp Leu Val He Pro Tyr Asn Glu Lys Pro Glu Lys Pro Ala 
45 35 40 45 

Lys Thr Gin Lys Thr Ser Leu Asp Glu Ala Leu Gin Trp Arg Asp Ser 
50 55 60 

50 Leu Asp Lys Leu Leu Gin Asn Asn Tyr Gly Leu Ala Ser Phe Lys Ser 
65 70 75 80 

Phe Leu Lys Ser Glu Phe Ser Glu Glu Asn Leu Glu Phe Trp lie Ala 
85 90 95 

55 

Cys Glu Asp Tyr Lys Lys He Lys Ser Pro Ala Lys Met Ala Glu Lys 
100 105 HO 

Ala Lys Gin He Tyr Glu Glu Phe He Gin Thr Glu Ala Pro Lys Glu 
60 115 120 125 



Val Asn He Asp His Phe Thr Lys Asp He Thr Met Lys Asn Leu Val 



WO 99/58670 
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130 

«« P» Ser Leu Ser 3 phe Asp ^ ue 

5 155 

- Hec Olu ,ys j.p Ser Leu Pro Arg phe ^ g "° 

170 

a/U 175 

Gin Glu Leu lie Lys 
10 180 



<210:> 26 
<2ll> 27 
15 <212> DNA 

<213> Artificial sequence 
<223> Probe/Primer 

<400> 26 

20 ccagatctaa agatgccgat ttgggcg 

27 

<=210> 27 
<211> 32 
25 <212> DNA 

<213> Artificial sequence 
<223> Probe/Primer 



0 



<400> 27 

ccccatggtt ttatatttgt tgtaaaaagt ag 



32 



<210> 28 
<211> 30 
5 <212> DNA 

<213> Artificial sequence 
<223> Probe/Primer 

<400> 28 

cgggatccat gtgcaaaggg cttgcaggtc 

<210> 29 
<2ll> 28 
<212> DNA 

<213> Artificial sequence 
<223> Probe/Primer 

<400> 29 

ccgctcgagt taggcacact gagg gacc 

<210> 30 
*<211> 41 
<212> DNA 

<213> Homo sapiens 
<400> 30 

agtcggtacc cgcacagatc tgcaggatgc cc CCCttg a c g 



30 



28 



WO 99/58670 
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- 12 - 

<211> 34 
<212> DNA 

<213> Artificial sequence 
<223> Probe/Primer 

<400> 31 

gtacgtcgac tttgattttc agaaacttga tggc 34 



10 <210> 32 
<211> 32 
<212> DNA 

<213> Artificial sequence 
<223> Probe/Primer 



15 



25 



35 



45 



55 



<400> 32 

tggcctcgag atgacaaatt caaaagaaga eg 32 



20 c210> 33 
<211> 32 
<212> DNA 

<213> Artificial sequence 
<220> Probe/Primer 



<400> 33 

ateactgeag etatgetaca acattccaaa at 32 



30 <210> 34 
<211> 32 
<212> DNA 

c213> Artificial sequence 
<223> Probe/Primer 



c400> 34 

gggtcatgaa actggccgcg atgatcaaga ag 32 



40 <210> 35 
<211> 31 
<212> DNA 

c213> Artificial sequence 
<223> Probe/Primer 



<400> 35 

gatagtcgac ctagctgatg acgcagcgct c 31 



50 <210> 36 
<211> 31 
<212> DNA 

<213> Artificial sequence 
<223> Probe/Primer 



<400> 36 

cgcatggtca tcctcgtttc gtccaaggtg g 31 



60 <210> 37 
<211> 31 
<212> DNA 



WO 99/58670 

PCT/US99/10151 

- 13 - 

<213> Artificial sequence 
<223> Probe/Primer 



5 



<400> 37 

ccaccttgga cgaaacgagg atgaccatgc g 



31 



<210> 38 
<211> 32 
!0 <212> DNA 

<213> Artificial sequence 
<223> Probe/Primer 



15 



<400> 38 

ccaaggacaa ggagcgcagc gtcatcagct ag 



32 



<210> 39 
<211> 32 
20 <212> DNA 

<213> Artificial sequenc 
<223> Probe/Primer 



25 



<400> 39 
ctagctgatg acgctgcgct ccttgt 



cctt gg 

32 



c210> 40 
<211> 837 
30 <212> DNA 

<213> Homo sapiens 

<220> 
<22l> CDS 
35 <222> (i) . . {834) 

<400> 40 



10 



Met Pro Ala Ser Leu lit Leu Gil p" T 9M " 9 " 9 *** 

, ^ Leu Gin Pro Arg Ala Met Met Lys Thr 

10 15 
ttg tec age ggg aac tgc acq etc aof e,r„ 

U SSr S « «y Asn Cy S Th * = u HI «- aac tea tae 

20 ro Ala h Y s Asn Ser Tyr 

25 30 



} 



S S S 5? 2 sj S 2 If, SJ S - - S ~ ffi 

40 45 
tct cgc etc etc aat g qc car C ^^ „ a „ 

^r Arg Phe Leu Asn ^ ^ gag gac cag tac aca ccc acc ate 
50 Y A f? Phe Glu As P Gin Tyr Thr Pro Thr lie 

b5 60 

Olu IS til - HI £ £ £ - - «*c 93 c gac atg tae cag 
6S Y Val T ^ r Asn He Arg Gly As p Met Tyr Gin 

75 

= 5 S 2 K S L« £ « s S - - ~ ^ ^ 



96 



144 



240 



288 



336 
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Arg Leu Ser lie Leu Thr Gly Asp Val Phe lie Leu Val Phe Ser Leu 
100 105 HO 

gat aac egg gag tec ttc gat gag gtc aag cgc ctt cag aag cag ate 3S4 
5 Asp Asn Arg Glu Ser Phe Asp Glu Val Lys Arg Leu Gin Lys Gin lie 
115 120 125 

ctg gag gtc aag tec tgc ctg aag aac aag acc aag gag gcg gcg gag 4 32 
Leu Glu Val Lys Ser Cys Leu Lys Asn Lys Thr Lys Glu Ala Ala Glu 
10 130 135 140 

ctg ccc atg gtc ate tgt ggc aac aag aac gac cac ggc gag ctg tgc 4 80 
Leu Pro Met Val He Cys Gly Asn Lys Asn Asp His Gly Glu Leu Cys 
145 150 155 160 

cgc cag gtg ccc acc acc gag gec gag ctg ctg gtg teg ggc gac gag 
Arg Gin Val Pro Thr Thr Glu Ala Glu Leu Leu Val Ser Gly Asp Glu 
165 170 175 

20 aac tgc gee tac ttc gag gtg teg gec aag aag aac acc aac gtg gac 576 
Asn Cys Ala Tyr Phe Glu Val Ser Ala Lys Lys Asn Thr Asn Val Asp 
180 185 190 

gag atg ttc tac gtg etc ttc age atg gee aag ctg cca cac gag atg 624 
25 Glu Met Phe Tyr Val Leu Phe Ser Met Ala Lys Leu Pro His Glu Met 
195 200 205 

age ccc gec ctg cat cgc aag ate tec gtg cag tac ggt gac gee ttc 672 
Ser Pro Ala Leu His Arg Lys He Ser Val Gin Tyr Gly Asp Ala Phe 
30 210 215 220 



15 



35 



40 



cac ccc agg ccc ttc tgc atg cgc cgc gtc aag gag atg gac gec tat 

His Pro Arg Pro Phe Cys Met Arg Arg Val Lys Glu Met Asp Ala Tyr 

225 230 235 240 

ggc atg gtc teg ccc ttc gee cgc cgc ccc age gtc aac agt gac etc 

Gly Met Val Ser Pro Phe Ala Arg Arg Pro Ser Val Asn Ser Asp Leu 

245 250 255 



gac aag tgc acc ate cag tga 
4 5 Asp Lys Cys Thr He Gin 
275 



528 



720 



768 



aag tac ate aag gee aag gtc ctt egg gaa ggc cag gec cgt gag agg S16 
Lys Tyr He Lys Ala Lys Val Leu Arg Glu Gly Gin Ala Arg Glu Arg 
260 265 270 



837 



<210:> 41 
50 <211> 278 
<212> PRT 

<213> Homo sapiens 
<400> 41 

55 Met Pro Ala Ser Leu Ala Leu Leu Gin Pro Arg Ala" Met Met Lys Thr 
15 10 15 

Leu Ser Ser Gly Asn Cys Thr Leu Ser Val Pro Ala Lys Asn Ser Tyr 
20 25 30 



60 



Arg Met Val Val Leu Gly Ala Ser Arg Val Gly Lys Ser Ser He Val 
35 40 45 



WO 99/58670 
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S« Arg Phe Leu Asn 01y Arg Phe 01u Asp Gln Tyr Thr ^ ^ n 



55 - 'i- inr lie 

* -p Phe „ is Arg Lys Val Tyr Asn ne Arg Q1 6 y ° 

75 

80 

Leu Asp He Leu Asp Thr Ser n„ ft 

q P Ser Gly Asn Hxs Pro Phe Pro Ala Met Arg 

10 90 55 y 



105 110 

15 A-p Asn s 01u ser Phe Asp ^ u val Lys Arg Leu ^ Lys ^ ^ 

- «» val Lys ser cys Lys ^ ^ 2 Ma Ma 

S 140 

20 Leu Pro Met Val ll e Cvs n v a , 

150 V ^ AE " HiS G1 V Cys 

Arg Gin Val Pro Thr Thr Glu Ala n„ , , 
25 165 3 G1U £jj Leu Val Ser Gly Asp Glu 

Asn Cys Ala Tyr Phe Glu 

180 *** **» L V= AS" Thr Asn Val Asp 

185 190 

>• °'" "" 5 *' L ~ - ;« - «• u. u . M . M „ 

205 

- S »• u. „. » t3 ^ „. s „ VI1 01 „ ^ Qiy 

220 



5 S ^ ^ ^ "< - g val hys Glu Met 

JU 235 

Gly Met Val Ser Pro Phe ai, b 

jo Phe Ala Arg Arg Pro Ser Val Asn 



250 Ser Asp Leu 

* ^ a. jj M . , v , W1 L „„ 01u „ ly M „ m> *" 

265 270 

Asp Lys Cys Thr lie Gin 
275 



<210> 42 
<211> 15 
<212> PRT 

<213> Homo sapiens 



c400> 42 

Asp Thr Lys ser Cys Leu Lys Asn 



<210> 43 
<2ll> 19 
<212> DNA 

<213> Artificial sequence 
<223> Probe/Primer 



Lys Thr Lys Glu Asn Val Asd 
10 15 



WO 99/58670 
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<400> 43 

ttctcgcgga tgtacatga 



<210> 44 

<211> 20 

<212> DNA 

<213> Artificial sequence 

10 <223> Probe/Primer 

<400> 44 

tccaccgcaa gttctactcc 



19 



15 



20 



45 



20 



<210> 45 

<211> 1740 

<212> DNA 

<213> Homo sapiens 

<220> 

<221=> CDS 

<222> (146) . . (988) 



25 <400> 45 

gagcggagcc ggagccccaa gcccgagccg cgcccagccc gagcagagcc ctccagccgc 6 0 
tcaccccgcg cgccacccca gcgaccctca gccgctctct gcccttctct cggccccgcg 120 



30 cccgccctcg cggcccctct gccca atg aaa ctg gcc gcg atg ate aag aag 

Met Lys Leu Ala Ala Met He Lys Lys 
1 ' 5 

atg tgc ccg age gac teg gag ctg agt: ate ccg gcc aag aac tgc tat 
35 Met Cys Pro Ser Asp Ser Glu Leu Ser He Pro Ala Lys Asn Cys Tyr 
10 15 20 25 

cgc atg gtc ate etc ggc teg tec aag gtg ggc aag acg gcc ate gtg 
Arg Met Val He Leu Gly Ser Ser Lys Val Gly Lys Thr Ala He Val 
40 30 35 40 

teg cgc ttc etc acc ggc cgc ttc gag gac gcc tac acg cct acc ate 
Ser Arg Phe Leu Thr Gly Arg Phe Glu Asp Ala Tyr Thr Pro Thr He 
45 50 55 • 



172 



220 



268 



316 



gag gac ttc cac cgc aag ttc tac tec ate cgc ggc gag gtc tac cag 364 
Glu Asp Phe His Arg Lys Phe Tyr Ser He Arg Gly Glu Val Tyr Gin 
60 65 70 



50 etc gac ate etc gac acg tec ggc aac cac ccg ttc ccc gcc atg egg 

Leu Asp He Leu Asp Thr Ser Gly Asn His Pro Phe Pro Ala Met Arg 

75 80 85 

cgc etc tec ate etc aca gga gac gtt ttc ate ctg gtg ttc agt ctg 

55 Arg Leu Ser He Leu Thr Gly Asp Val Phe He Leu Val Phe Ser Leu 

90 95 100 105 

gac aac cgc gac tec ttc gag gag gtg cag egg etc agg cag cag ate 

Asp Asn Arg Asp Ser Phe Glu Glu Val Gin Arg Leu Arg Gin Gin lie 

60 110 H5 120 

etc gac acc aag tct tgc etc aag aac aaa acc aag gag aac gtg gac 



412 



460 



508 



556 



WO 99/58670 
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- - ~ S„ c y , ^ Ly , ^ Ly , , hr Ly , ffl> ^ ^ 

5 S S S S; " : * ~ E- - S5 5 S 5 £ - - - 

150 

9 a 9 gtg gac caq cor nan at- 

Glu Val Asp Gin Ar g Glu lie Glu ," 9 ^ " C 93 c 3«= «c cag S52 

10 155 Hi Glu Gin Leu Val Gly Asp Asp Pro Gin 

165 

i * E S S K S 2 E - •« « » s 2 „ „. 

180 185 

cag atg ttc cgc gcq etc i-i-r* ~ 

01- «e t Phe Arg ** Leu «" |« -g g c agc 

190 a Lys Leu p ^o Ser Glu Met 

20 200 

~ s - - 2 53 s - 5; S Sf s - 

» 3 K 2 S - 3 K 2 S s e i 2 5 «« 

245 

. S s s s = z " : E - - « £ s s a - - 

" = «- - s s e s ~ ~ ~ * ~ ;s 



275 

280 



« taggagcccc 3 ccgc g c C gg cgacacaacc taaggaggac cttcttctt 

y» ggac ctttttgtta agtcaaatcc 1048 

™ — , 3 _ _ m ^ 
<5 — _ s „ rogw 99icsgcc>t ^ 

•»... MM « SM „„ g . ctfflj , cti eecc 

9».c««, «" 3 , 33 « = 
WMMCO c , g „ wsj tKitemc ^ 

~ -e^, t3 , 393t „ 3 g _ 
„ , m stwc[c 

~ — "~ ~. w . 

~ t ss[ , gle „ 3 

— «~~, ^ ttMmett 
™ -™ „,« 93 „, _ _ 
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aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa 1740 



5 <210> 46 
<211> 281 
<212> PRT 

<213> Homo sapiens 
10 <400> 46 

Met Lys Leu Ala Ala Met He Lys Lys Met Cys Pro Ser Asp Ser Glu 
15 10 15 

Leu Ser He Pro Ala Lys Asn Cys Tyr Arg Met Val He Leu Gly Ser 
15 20 25 30 

Ser Lys Val Gly Lys Thr Ala He Val Ser Arg Phe Leu Thr Gly Arg 
35 40 45 

20 Phe Glu Asp Ala Tyr Thr Pro Thr He Glu Asp Phe His Arg Lys Phe 
50 55 60 

Tyr Ser lie Arg Gly Glu Val Tyr Gin Leu Asp He Leu Asp Thr Ser 
65 70 75 80 

25 

Gly Asn His Pro Phe Pro Ala Met Arg Arg Leu Ser He Leu Thr Gly 
85 90 95 

Asp Val Phe He Leu Val Phe Ser Leu Asp Asn Arg Asp Ser Phe Glu 
30 100 105 HO 

Glu Val Gin Arg Leu Arg Gin Gin lie Leu Asp Thr Lys Ser Cys Leu 
115 120 125 

3 5 Lys Asn Lys Thr Lys Glu Asn Val Asp Val Pro Leu Val He Cys Gly 
130 135 140 

Asn Lys Gly Asp Arg Asp Phe Tyr Arg Glu Val Asp Gin Arg Glu lie 
145 150 155 160 

40 

Glu Gin Leu Val Gly Asp Asp Pro Gin Arg Cys Ala Tyr Phe Glu lie 
165 170 175 

Ser Ala Lys Lys Asn Ser Ser Leu Asp Gin Met Phe Arg Ala Leu Phe 
45 180 185 190 

Ala Met Ala Lys Leu Pro Ser Glu Met Ser Pro Asp Leu His Arg Lys 
195 200 205 

50 Val Ser Val Gin Tyr Cys Asp Val Leu His Lys Lys Ala Leu Arg Asn 
210 215 220 

Lys Lys Leu Leu Arg Ala Gly Ser Gly Gly Gly Gly Gly Asp Pro Gly 
225 230 235 240 

55 

Asp Ala Phe Gly He Val Ala Pro Phe Ala Arg Arg Pro Ser Val His 
245 250 255 

Ser Asp Leu Met Tyr He Arg Glu Lys Ala Ser Ala Gly Ser Gin Ala 
60 260 265 270 



Lys Asp Lys Glu Arg Cys Val lie Ser 
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275 



280 



<210> 47 
<211> 189 
5 <212> PRT 

<213> Homo sapiens 

<400> 47 

10 



io 15 

Ser Ma Leu Thr He Gin Leu He Gln Asn His phg Val Asp ^ ^ 

23 30 

» asp Pro Thr „. Glu Asp ser Tyr Arg ^ ne 

40 45 
«« Tjr C ys Leu Leu Asp n Leu Asp Thr ^ My ^ ^ ^ 
20 5 60 

Ser Ala Met Arg Asp <n Tyr Met Arg Thr ^ ^ ^ ^ ^ 

75 

^ Val Phe Aia Ile A *„ As „ Thr Lys Ser ^ Qlu Asp ^ ^ ^ 

ciu cm ne Lys Arg val Lys ^ ser Asp Asp J' 

30 Leu v.l cjy As „ L yS C ys Asp Leu Ala Ala Arg Thr ^ ^ ^ ftrg 

120 125 

Gin Ala Gin Asp Leu Ala Arq Ser Tvr ri„ n n 

130 ?i? Tyr Ile p ^o Tyr lie Glu Thr 

35 XJb 140 

s- M . Lys Thr Arg Qly vai wu ^ ^ ^ ^ 

155 

^ 160 
4o Ar 3 alu ne A r 3 G1 His Lys Leu Arg Lys Leu Asn prQ prQ Aso 

170 175 
Ser Gly Pro G1 cys Mec Ser Cys ^ ^ 

180 185 

45 

<210> 48 
<211> 206 
<212> PRT 

<213> Homo sapiens 

50 

<400> 48 



io 15 

* Vai ne Met Val 01y Ser Gly Gly Val aly Lys ^ ^ ^ ^ ^ 

25 30 

60 40 45 

ser ryr Arg Lys Lys val Asp fliu v ^ ^ ^ e 



55 60 
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Asp lie Leu Asp Thr Ala Gly Gin Glu Asp Tyr Ala Ala He Arg Asp 

65 *70 75 80 

5 Asn Tyr Phe Arg Ser Gly Glu Gly Phe Leu Cys Val Phe Ser He Thr 

85 90 95 

Glu Met Glu Ser Phe Ala Ala Thr Ala Asp Phe Arg Glu Gin He Leu 

100 105 HO 



10 



Arg Val Lys Glu Asp Glu Asn Val Pro Phe Leu Leu Val Gly Asn Lys 
115 120 125 



Ser Asp Leu Glu Asp Lys Arg Gin Val Ser Val Glu Glu Ala Lys Asn 

15 130 135 140 

Arg Ala Glu Gin Trp Asn Val Asn Tyr Val Glu Thr Ser Ala Lys Thr 

145 150 155 160 

20 Arg Ala Asn Val Asp Lys Val Phe Phe Asp Leu Met Arg Glu He Arg 

165 170 175 

Ala Arg Lys Met Glu Asp Ser Lys Glu Lys Asn Gly Lys Lys Lys Arg 

180 * 185 190 



25 



Lys Ser Leu Ala Lys Arg He Arg Glu Arg Cys Cys lie Leu 
195 200 205 



30 <210> 49 
<211> 205 
<212> PRT 
<213> Homo sapiens 

35 <400> 49 

Met Ser Ser Met Asn Pro Glu Tyr Asp Tyr Leu Phe Lys Leu Leu Leu 
15 10 15 

He Gly Asp Ser Gly Val Gly Lys Ser Cys Leu Leu Leu Arg Phe Ala 
40 20 25 30 

Asp Asp Thr Tyr Thr Glu Ser Tyr He Ser Thr He Gly Val Asp Phe 
35 40 45 

45 Lys He Arg Thr He Glu Leu Asp Gly Lys Thr He Lys Leu Gin He 
50 55 60 

Trp Asp Thr Ala Gly Gin Glu Arg Phe Arg Thr He Thr Ser Ser Tyr 
65 70 75 80 

50 

Tyr Arg Gly Ala His Gly He He Val Val Tyr Asp Val Thr Asp Gin 
85 90 95 

Glu Ser Phe Asn Asn Val Lys Gin Trp Leu Gin Glu He Asp Arg Tyr 
55 100 105 HO 

Ala Ser Glu Asn Val Asn Lys Leu Leu Val Gly Asn Lys Cys Asp Leu 
115 120 125 

60 Thr Thr Lys Lys Val Val Asp Tyr Thr Thr Ala Lys Glu Phe Ala Asp 
130 135 140 
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Ser Leu Gly n e Pro Phe Leu Glu Thr Ser A i fl r * 
145 er Ala L ^ s As " Ala Thr Asn 

155 

s val oiu Gin Ser Phe Met Thr Met A1 a Ala oiu lie Lys Lys Arg Z 

5 170 

I/u 175 

Oly Pro oiy jl. Thr Ala Gl y Gly Ala Glu Lys Sfir ^ ^ ^ ^ 

185 igo 
10 Gin ser Thr Pro Val Lys cln Ala Gly Gly 



200 205 



<210> 50 
15 <211> 210 
<212> PRT 

<213> Homo sapiens 



<400> 50 

20 Met Thr A la Ala Gin Ala Ala Cl y clu Clu A la Pro Pro G1 



Y Val Arg 
io 15 



Ser Val Lys v . ; Val Leu ^ Wy ^ ^ ^ ^ ^ ^ ^ ^ 

25 25 30 

Leu Met Val Phe A la Asp Giy Ala phe prQ Mu ^ ^ ^ ^ 

40 45 
3Q Thr val Phe Glu Arg T y r Met Val Asn Leu Gin Val Lys Gly Lys prQ 

bb 60 
Val His Leu „ is Ile ^ Agp Thr ftla sin Asp Asp Tyr ^ ^ 

35 Leu Arg Pro L eu Phe T yr Pro Asp Ala Ser Val Leu Leu Leu C ys Phe 



85 90 

yu 95 



Val Thr Ser Pro Asn Ser Phe Asp Asn Xle Phe Asn Ar g Trp Tyr 

105 110 



Glu val Asn „ ls Phe Cys Ly£ y-1 ^ ^ ^ ^ ^ 

120 125 
s C.S Lys Thr Asp Leu Arg Lys Asp Lys Ser Leu ^ ^ ^ ^ 

1J::> 140 
Arg Asn Gly Leu Glu Pro Val Thr Tvr * 

145 Vai rhr T ^ r Hls Arg Gly Gin Glu Met Ala 

155 

0 - g Ser Val aly A1 Val Ala Tyr ^ ^ ^ ^ ^ ^ ™ 

170 

ASP Asn Val His Ala Val Phe 01n Glu Ala Ala Mu Val ^ ^ 

185 190 
Ser Ar g oiy Ar g Asn Phe Trp Ar g A rg xle Thr Gin Gl y Phe Cys Val 



200 205 



Val Thr 
210 
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<210> 51 
<211> 191 
<212> PRT 

<213> Homo sapiens 

5 

<400> 51 

Met Gin Thr lie Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 
15 10 15 

10 Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Lys Phe Pro Ser Glu Tyr 
20 25 30 

Val Pro Thr Val Phe Asp Asn Tyr Ala Val Thr Val Met He Gly Gly 
35 40 45 

15 

Glu Pro Tyr Thr Leu Gly Leu Phe Asp Thr Ala Gly Gin Glu Asp Tyr 
50 55 60 

Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu Val 
20 65 70 75 80 

Cys Phe Ser Val Val Ser Pro Ser Ser Phe Glu Asn Val Lys Glu Lys 
85 90 95 

25 Trp Val Pro Glu He Thr His His Cys Pro Lys Thr Pro Phe Leu Leu 
100 105 110 

Val Gly Thr Gin He Asp Leu Arg Asp Asp Pro Ser Thr He Glu Lys 
115 120 125 

30 

Leu Ala Lys Asn Lys Gin Lys Pro He Thr Pro Glu Thr Ala Glu Lys 
130 135 140 

Leu Ala Arg Asp Leu Lys Ala Val Lys Tyr Val Glu Cys Ser Ala Leu 
35 145 150 155 ISO 

Thr Gin Arg Gly Leu Lys Asn Val Phe Asp Glu Ala He Leu Ala Ala 
165 170 175 

40 Leu Glu Pro Pro Glu Thr Gin Pro Lys Arg Lys Cys Cys He Phe 
180 185 190 



<210> 52 
45 <211> 192 
<212> PRT 
<213> Homo sapiens 

<400> 52 

50 Met Gin Ala He Lys Cys Val Val Val Gly Asp Gly Ala Val Gly Lys 
15 10 15 

Thr Cys Leu Leu He Ser Tyr Thr Thr Asn Ala Phe Pro Gly Glu Tyr 
20 25 30 

55 

He Pro Thr Val Phe Asp Asn Tyr Ser Ala Asn Val Met Val Asp Ser 
35 40 45 

Lys Pro Val Asn Leu Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr 
60 50 55 60 

Asp Arg Leu Arg Pro Leu Ser Tyr Pro Gin Thr Asp Val Phe Leu He 
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65 70 '5 

cys Phe s er Leu v.i Ser Pro Ala sgr Tyr Giu J° 

5 90 95 

Trp Phe Pro oiu Va! Arg „ is His Cys prQ Ser ^ ^ ^ ^ ^ 

105 110 
iQ Vai Gly Thr Ly s Leu Asp Leu Arg Asp Asp Lys Asp rhr ^ ^ ^ 

120 125 
Leu Lys 01u Lys Lys Leu pro ne ^ flin ^ ^ 

135 140 

IS Leu Ala Lys aiu Ue As p Ser v .l Lys Tyr Leu Glu Cys Ser ^ ^ 

160 

Thr Cln Arg Gly ^ Lys Thr val ^ Gi(j au ne ^ A ^ v ^ 

20 170 175 

*u c ys Pro ;1 Pro Thr Arg Qln ^ a ^ au ^ ^ ^ 



185 190 



25 



<210> 53 
<211> 181 
30 <212> PRT 

<213> Homo sapiens 

^ Me, G l" Gly Phe phe ser ser ne ^ ^ ^ ^ 

10 15 



«« Me, Ar g Ue Leu ne Leu Gly Leu As p 01y Ala 01y Lys Thr ?hr 

25 30 
<° He Leu Tyr Arg Leu Gln Val My ^ ^ ^ ^ ^ ^ ^ 

40 45 

He Gly Phe Asn Val Glu Thr Val Th>- t, 

50 „ al Thr Tyr Lys Asn Leu Lys Phe Gin 

45 bb 60 

Val Trp Asp Leu Gly Gly Gin Thr c<= r T i« a 
65 7 q ihr Ser Ile Ar 9 p ^o Tyr Trp Arg Cys 

75 80 

o Tyr Ser Asn Th r Asp Ala val ne ^ val ^ 

Asp Arg Ue 01y ne Ser Lys Ser Glu Leu ^ ^ ^ ^ ^ 

105 110 
S «u Olu Glu Leu Ar g L ys Ala Ile Leu val Val phe Ma Asn ^ 

120 125 

-P JJJ 01u Gln Ala Met Thr Ser Ser Glu Met Ala Asn ser Leu G 

1Ji 140 
£ Aia Leu Lys ^ ftrg Lys ^ ^ ^ ^ t ^ 



155 160 
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Thr Lys Gly Thr Gly Leu Asp Glu Ala Met Glu Trp Leu Val Glu Thr 
165 170 175 

5 Leu Lys Ser Arg Gin 
180 



<210> 54 
10 <211> 229 
<212> PRT 

<213> Homo sapiens 
<400> 54 

15 Met Asp Pro Asn Gin Asn Val Lys Cys Lys lie Val Val Val Gly Asp 
15 10 15 

Ser Gin Cys Gly Lys Thr Ala Leu Leu His Val Phe Ala Lys Asp Cys 
20 25 30 

20 

Phe Pro Glu Asn Tyr Val Pro Thr Val Phe Glu Asn Tyr Thr Ala Ser 
35 40 45 

Phe Glu lie Asp Thr Gin Arg lie Glu Leu Ser Leu Trp Asp Thr Ser- 
25 50 55 60 

Gly Ser Pro Tyr Tyr Asp Asn Val Arg Pro Leu Ser Tyr Pro Asp Ser 
65 70 75 80 

30 Asp Ala Val Leu He Cys Phe Asp He Ser Arg Pro Glu Thr Leu Asp 
85 90 95 

Ser Val Leu Lys Lys Trp Lys Gly Glu He Gin Glu Phe Cys Pro Asn 
100 105 HO 

35 

Thr Lys Met Leu Leu Val Gly Cys Lys Ser Asp Leu Arg Thr Asp .Val 
115 120 125 

Ser Thr Leu Val Glu Leu Ser Asn His Arg Gin Thr Pro Val Ser Tyr 
40 130 135 140 

Asp Gin Gly Ala Asn Met Ala Lys Gin He Gly Ala Ala Thr Tyr He 
145 150 155 160 

45 Glu Cys Ser Ala Leu Gin Ser Glu Asn Ser Val Arg Asp He Phe His 
165 170 175 

Val Ala Thr Leu Ala Cys Val Asn Lys Thr Asn Lys Asn Val Lys Arg 
180 185 190 

50 

Asn Lys Ser Gin Arg Ala Thr Lys Arg He Ser His Met Pro Ser Arg 
195 200 205 

Pro Glu Leu Ser Ala Val Ala Thr Asp Leu Arg Lys Asp Lys Ala Lys 
55 210 215 220 

Ser Cys Thr Val Met 
225 

60 

<210> 55 
<211> 16 
<212> PRT 
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<213> Homo sapiens 
<400> 55 

5 Ly, lie Val Val Val Gly Asp Ser Gln C ys GIy Lys Thr Ala Leu Leu 

10 15 



<210> 56 
<211> 16 
10 <212> PRT 

<213> Homo sapiens 



<400> 56 



15 



^ Ile Val val VaJ Gly Asp Aia giu ^ ^ ^ 

10 15 



<210> 57 
<211> 16 
20 <212> PRT 

<213> Homo sapiens 



<400> 57 



25 



L ys l su v.i Leu va z Giy Asp val Gln cys Gly Lys rhr Ma ^ 

10 15 



<210> 58 
<211> 16 
30 <212> PRT 

<213> Homo sapiens 

<400> 58 



3s Lyj Leu VaX n e v.l « y Asp Gly Ala Cys Gly Lys Thr ^ ^ ^ 

10 15 



<210> 59 
<211> 16 
40 <212> PRT 

<213> Homo sapiens 

<400> 59 



^ ^ * u Val val V.1 Gly Asp Gly Aia Cys Wy Lys 

10 15 



<210> 60 
<211> 16 
50 <212> PRT 

<213> Homo sapiens 



55 



<400> 60 

L ys Cys Vai vai v .i Gly Asp Gly Ala val Giy Lys cys 

10 15 



<210> 61 
<211> 16 
60 <212> prt 

<213> Homo sapiens 



<400> 61 

ss L ys c ys v ai val Va i Gly Asp Gly Aia vai Giy Lys c ^ ^ 

io 15 
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<210> 62 
<211> 16 
<212> PRT 
5 <213> Homo sapiens 



<400> 62 

Lys Leu Val Val Val Gly Ala Gly Gly Val Gly Lys Ser Ala Leu Thr 
IS 10 15 



10 



<210> 63 
<211> 16 
<212> PRT 
15 <213> Homo sapiens 



<400> 63 

Arg Met Val He Leu Gly Ser Ser Lys Val Gly Lys Thr Ala He Val 
15 10 15 

20 



<210> 64 

<211> 13 

<212> PRT 

25 <213> Homo sapiens 



<400> 64 

Leu Ser Leu Trp Asp Thr Ser Gly Ser Pro Tyr Tyr Asp 
1 5 10 

30 



<210> 65 

<211> 13 

<212> PRT 

35 <213> Homo sapiens 



<400> 65 

Leu Asn Met Trp Asp Thr Ser Gly Ser Ser Tyr Tyr Asp 
1 5 10 

40 



<210> 66 
<211> 13 
<212> PRT 
45 <213> Homo sapiens 



<400> 66 

Leu Ser Leu Trp Asp Thr Ser Gly Ser Pro Tyr Tyr Asp 
15 10 

50 



<210> 67 

<211> 13 

<212> PRT 

55 <213> Homo sapiens 



<400> 67 

Leu Ala Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr Asp 
1 5 10 

60 



<210> 68 
<211> 13 
<212> PRT 
65 <213> Homo sapiens 
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<400> 68 



^ Ala Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr Asp 
5 10 



10 



15 



20 



<210> 69 
<211> 13 
<212> PRT 

<213> Homo sapiens 
<400> 69 

eu Phe Asp Thr Ala Gly Gin i 

10 



<400> 69 

Leu Gly Leu Phe Asp Thr Ala Gly Gin Glu Asp Tyr Asp 

5 i n 



<210> 70 
<211> 13 
<212> PRT 

<213> Homo sapiens 



<400> 70 

teu Gly Leu Trp Asp Thr Ala Gly Gin Glu Asp Tyr Asp 

J 10 



25 



30 



<210> 71 
<211> 13 
<212> prt 

<213> Homo sapiens 



<400> 71 

<-» Asp He Leu Asp Thr Ala Gly Gin Glu Glu Tyr Asp 



35 



40 



45 



<210> 72 
<211> 13 
<212> PRT 

<213> Homo sapiens 



<400> 72 

Asp lie Leu Asp Thr Ser GJy ^ ^ ^ ^ ^ 

* 10 



50 
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